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TITLE OF INVENTION 

CHLAMYDIA ANTIGENS AND CORRESPONDING DNA FRAGMENTS AND USES 

THEREOF 
FIELD OF INVENTION 
The present invention relates to Chlamydia antigens and corresponding DNA 
molecules, which can be used in methods to prevent and treat Chlamydia infection in mammals, 
such as humans. 

BACKGROUND OF THE INVENTION 

Chlamydiae are prokaryotes. They exhibit morphologic and structural similarities to 
gram-negative bacteria including a trilaminar outer membrane, which contains 
lipopolysaccharide and several membrane proteins that are structurally and functionally 
analogous to proteins found in E coli. They are obligate intra-cellular parasites with a unique 
biphasic life cycle consisting of a metabolically inactive but infectious extracellular stage and a 
replicating but non-infectious intracellular stage. The replicative stage of the life-cycle takes 
place within a membrane-bound inclusion which sequesters the bacteria away from the 
cytoplasm of the infected host cell. . . _ _ 

C pneumoniae is a common human pathogen, originally described as the TWAR strain 
of Chlamydia psittaci but subsequently recognised to be a new species. C. pneumoniae is 

trachomatis, C pecorum and G psittaci). It shows 10% or less DNA sequence homology with 
either of C. trachomatis or Cpsittaci and so far appears to consist of only a single strain, TWAR. 

C pneumoniae is a common cause of community acquired pneumonia, only less frequent 
than Streptococcus pneumoniae and Mycoplasma pneumoniae (Ref 1,2). It can also cause upper 
respiratory tract symptoms and disease, including bronchitis and sinusitis (Ref 1,3,4,5). The 
great majority of the adult population (over 60%) has antibodies to C pneumoniae (Ref 5), 
indicating past infection which was unrecognized or asymptomatic. 

Of considerable importance is the association of atherosclerosis and C pneumoniae 
infection. There are several epidemiological studies showing a correlation of previous infections 
with G pneumoniae and heart attacks, coronary artery and carotid artery disease (Ref 6-10). 
Moreover, the organisms has been detected in atheromas and fatty streaks of the coronary, 
carotid, peripheral arteries and aorta (Ref 11-15). Viable C pneumoniae has been recovered 
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from the coronary and carotid artery (Ref 16,17). Furthermore, it has been shown that C. 
pneumoniae can induce changes of atherosclerosis in a rabbit model (Ref 18). Taken together, 
these results indicate that it is highly probable that C. pneumoniae can cause atherosclerosis in 
humans, though the epidemiological importance of chlamydial atherosclerosis remains to be 
demonstrated. . * 

A number^of tffecent#slMies**have*als©M^ pneumoniae 
infection and asthmas* Infection has*been4iiikedvto^wheezm^ 
asthma and acute exacerbati'om-of asthrha**in adults?*and*smalfcsGale*sn^^ 
prolonged antibiotic treatment was effective at greatly reducing the severity of the disease in 
some individuals (Ref 1 9-24) . 

In light of these results a protective vaccine against C pneumoniae infection would be of 
considerable importance. There is not yet an effective vaccine for any human chlamydial 
infection. Nevertheless, studies with C. trachomatis and C psittaci indicate that this is an 
attainable goal. For example, mice which have recovered from a lung infection with C 
trachomatis^^ proteeted*from*infe^ 

Similarly, sheep immunized with inactivated* C psittaei* were protected%,from subsequent 
chlamydial-indjuc^. abortion has 
been associated with^Thl^immune^ producing 

conferred proteetk>n**from- challengejfor cleared^chrbnic disease^(RfeT^8^9^and >> in *vivo 
depletion of CD4+ T cells- exacerbated^disease postr.challenge, v (Ref r3 However^ the 

presence of sufficiently high titres of neutralising antibody at mucosal surfaces can also exert a 
protective effect (Ref 32). 

The extent of antigenic variation within the species C. pneumoniae is not well 
characterised. Serovars of C trachomatis are defined on the basis of antigenic variation in 
MOMP, but published C pneumoniae MOMP gene sequences show no variation between 
several diverse isolates of the organism (Ref 33-35). Regions of the protein known to be 
conserved in other chlamydiaLIyfOMPs are conserved in. C^pneumonide«(Rsf*3 3, 34)- One study 
has described a strain of C. pneumoniae&vnih a* M©MI&if greate^^ 
but the gene for this- has notlbeen^sequenced .(ReM$l %Parti#*sequenees*qf:rd^ 
protein 2 from nine diverse^solatesAw^ 
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HSP60 and HSP70 show little variation from other chlamydial species, as would be expected. 
The gene encoding a 76kDa antigen has been cloned from a single strain of C. pneumoniae. It 
has no significant similarity with other known chlamydial genes (Ref 4). 

Many antigens recognised by immune sera to C pneumoniae are conserved across all 
5 chlamydiae, but 98kDa, 76 kDa and 54 kDa proteins may be C. pneumoniae-specific (Ref 2, 4, 
36). Immunoblotting of isolates with sera from patients does show variation of blotting patterns 
between isolates, indicating that serotypes G pneumoniae may exist (Ref 1,16). However, the 
results are potentially confounded by the infection status of the patients, since immunoblot 
profiles of a patient's sera change with time post-infection. An assessment of the number and 
10 relative frequency of any serotypes, and the defining antigens, is not yet possible. 

C. pneumoniae infection usually presents as an acute respiratory disease (i.e., cough, sore 
throat, hoarseness, and fever; abnormal chest sounds on auscultation). For most patients, the 
^ cough persists for 2 to 6 weeks, and recovery is slow. In approximately 10% of these cases, 
O upper respiratory tract infection is followed by bronchitis or pneumonia. Furthermore, during a 
gg5 C pneumoniae epidemic, subsequent co-infection with pneumococcus has been noted in about 
jjj half of these^ pneumonia patients, particularly in the. infirm and the elderly. -As noted above, 
Mf there is more and more evidence that C pneumoniae infection is also linked to diseases other 
s than respiratory infections. 
l± — Tfre^servoi r^ o r-t he-^ 

there is no known bird or animal reservoir. Transmission has not been clearly defined. It may 
=0 result from direct contact with secretions, from formites, or from airborne spread. There is a long 
incubation period, which may last for many months. Based on analysis of epidemics, C. 
pneumoniae appears to spread slowly through a population (case-to-case interval averaging 30 
days) because infected persons are inefficient transmitters of the organism. Susceptibility to C. 
25 pneumoniae is universal. Reinfections occur during adulthood, following the primary infection as 
a child. G pneumoniae appears to be an endemic disease throughout the world, noteworthy for 
superimposed intervals of increased incidence (epidemics) that persist for 2 to 3 years. 
C trachomatis infection does not confer cross-immunity to G pneumoniae. Infections are easily 
treated with oral antibiotics, tetracycline or erythromycin (2 g/d, for at least 10 to 14 d). A 
30 recently developed drug, azithromycin, is highly effective as a single-dose therapy against 
chlamydial infections. 
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In most instances, C pneumoniae infection is often mild and without complications, and 
up to 90% of infections are subacute or unrecognized. Among children in industrialized 
countries, infections have been thought to be rare up to the age of 5 y, although a recent study (E 
Normann et al, Chlamydia pneumoniae in children with acute respiratory tract infections, Acta 
5 Paediatrica, 1998, Vol 87, Iss 1, pp 23-27) has reported that many children ih:this age group 
show PGR evidence: of infection despite being seronegative- and estimates^ prevalence of 17- 
19% in 2-4 y olds; In developing countries, the seroprevalence of C. pneumoniae antibodies 
among young children is elevated, and there -are suspicions that C. pneumoniae may be an 
important cause of acute lower respiratory tract disease and mortality for infants and children in 
10 tropical regions of the world. 

From seroprevalence studies and studies of local epidemics, the initial C. pneumoniae 
infection usually happens between the ages of 5 and 20 y. In the USA, for example, there are 
„. estimated to be 30,000 cases of childhood pneumonia each year caused by C. pneumoniae. 
C3 Infections may cluster among groups of children or young adults (e.g., school pupils or military 
m conscripts). 

Tfi C. pneumoniae causes 10 to 25%.of con^vmity-acquired jower respiratory tract infections 

^ (as reported from Sweden^Italy, Finland, and the USA). During- amepidemic, C pneumonia 
5 infection may account for 50 to 60% of the eases*of pneum also; more 

ip i s ode s of mixedM n fectioas witfcS. pn eu mom ae =h&m beeireeported* , f 

Wp Reinfection during, adulthoodrfs com Based on 

S population seroprevalencet.studies$^there; tends to be increasedi»exposxire with, age, which is 
particularly evident among men. Some investigators have speculated that a persistent, 
asymptomatic C. pneumoniae infection state is common. 

In adults of middle age or older, C pneumoniae infection may progress to chronic 
25 bronchitis and sinusitis. A study in the USA revealed that the incidence of pneumonia caused by 
C pneumoniae in persons younger than 60 years is 1 case per 1,000 persons per year; but in the 
elderly, the disease incidence rose three-fold. C. pneumoniae infection rarely leads to 
hospitalization, except in patients^with an underlying jllness. 

SUMMARY QFWTHE JN^ENTION^ v 
30 The present* inventionfcprovide^ p that encode 

Chlamydia polypeptid'es^designated^ CPN^OOSSO^SE^ ID No^l ,2), whiehscan be used in 



RY47 101998 



10 



ffc5 



So 

i il 



25 



methods to prevent, treat, and diagnose Chlamydia infection. The encoded polypeptides include 
polypeptides having the amino acid sequence shown in SEQ ID No:3 and 4. Those skilled in the 
art will appreciate that the invention also includes DNA molecules that encode mutants and 
derivatives of such polypeptides, which result from the addition, deletion, or substitution of non- 
essential amino acids as described herein. The invention also includes RNA molecules 
corresponding to the DNA molecules of the invention. 

In addition to the DNA and RNA molecules, the invention includes the corresponding 
polypeptides and monospecific antibodies that specifically bind to such polypeptides. 

The present invention has wide application and includes expression cassettes, vectors, 
and cells transformed or transfected with the polynucleotides of the invention. Accordingly, the 
present invention provides (i) a method for producing a polypeptide of the invention in a 
recombinant host system and related expression cassettes, vectors, and transformed or transfected 
cells; (ii) a live vaccine vector, such as a pox virus, Salmonella typhimurium, or Vibrio cholerae 
vector, containing a polynucleotide of the invention, such vaccine vectors being useful for, e,g, y 
preventing and treating Chlamydia infection, in combination with a diluent or carrier, and related 
pharmaceutical .compositions and associated- therapeutic and/or prophylactic methods; (iii) a 
therapeutic and/or prophylactic method involving administration of an RNA or DNA molecule of 
the invention, either in a naked form or formulated with a delivery vehicle, a polypeptide or 
-combination of polypeptid e s, or - a — monospecific antibody of lhe~- mv e ntion, and related— 



30 



pharmaceutical compositions; (iv) a method for diagnosing the presence of Chlamydia in a 
biological sample, which can involve the use of a DNA or RNA molecule, a monospecific 
antibody, or a polypeptide of the invention; and (v) a method for purifying a polypeptide of the 
invention by antibody-based affinity chromatography. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will be further understood from the following description with 
reference to the drawings, in which: 

Figure 1 shows the nucleotide sequence of the CPN1 00630 (SEQ ID No: 1 - entire 
sequence and SEQ ID No: 2 - coding sequence) and the deduced amino acid sequence of the 
CPN100630 protein from Chlamydia pneumoniae (SEQ ID No: 3 - full length and 4 - 
processed). 
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Figure 2 shows the restriction enzyme analysis of the gene encoding the C. 
pneumoniae CPN100630 gene. 

DETAILED DESCRIPTION OF INVENTION 

In the C. pneumoniae genome, open reading frames (ORFs) encoding chlamydial 
polypeptides4Lave*been*identifie$^ 
in the bacteriafcmembrane*stracture^ 

bacterial ^membrane^ ^include^o^ the^inelusioni*meinbraneK 
structure, polypeptides that are present dn the^extemaI^yichrity v of*th€S nnclusiommembrane, and 
polypeptides that are released into the cytoplasm of the infected cell. These polypeptides can be 
used in vaccination methods for preventing and treating Chlamydia infection. 

According to a first aspect of the invention, there are provided isolated 
polynucleotides encoding the precursor and mature forms of Chlamydia polypeptides. 

An isolated polynucleotide of the invention encodes (i) a polypeptide having an 
amino acid sequence that is homologous to a Chlamydia amino acid, the Chlamydia amino acid 
sequence being seleeted#om4he*group eonsisting^pf: 

(a) the .amino acid sequences as-shown: .(SEQ ID No~»3 and-4) 

The^teim*^isolate^ define^as aipoiynueleQtiderfremp^ed*»from the 

environments whiehcat -natup^ly w $ec^ natu^^ly^pccurFing^gNA molecule 

-present in the genome of^4iving bacteria^r^s^artx^fa, gene bank is noHsoteted^but th c ^ 5^ne= 



So molecule separatedfcfromrthe*remaining#^ a result*df^e. ( g?^a cloning 

event (amplification), is isolated; * Typically, an isolated DNA molecule is free* from DNA 
regions (e.g., coding regions) with which it is immediately contiguous at the 5' or 3' end, in the 
naturally occurring genome. Such isolated polynucleotides could be part of a vector or a 
composition and still be isolated in that such a vector or composition is not part of its natural 

25 environment. 

A polynucleotide of the invention can be in the form of RNA or DNA (e.g. y cDNA, 
genomic DNA, or synthetic DNA), or modifications or combinations thereof. The DNA can be 
double-stranded or single-stranded^ and, *if single^stranded^ean^be the ^ding^sfrandi^the non-: 
coding (anti-sense) strand. The sequence that«encodes:a<ipolypeptide*0f the; 4nventic>nias shown in 
30 SEQ ID NOs: 1 and 2, can be (a) the Coding ^equenge as shown^in -SEQ ID .NOs:2%,(b) a 
ribonucleotide sequence derived by transcription of (a) ; or (c) a different coding ^sequence; this 
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latter, as a result of the redundancy or degeneracy of the genetic code, encodes the same 
polypeptides as the DNA molecules of which the nucleotide sequences are illustrated in SEQ ID 
NOs:l to 2. 

By "polypeptide" or "protein" is meant any chain of amino acids, regardless of length 
or post-translational modification (e.g., glycosylation or phosphorylation). Both terms are used 
interchangeably in the present application. 

By "homologous amino acid sequence" is meant an amino acid sequence that differs 
from an amino acid sequence shown in SEQ ID No: 3 or 4, only by one or more conservative 
amino acid substitutions, or by one or more non-conservative amino acid substitutions, deletions, 
or additions located at positions at which they do not destroy the specific antigenicity of the 
polypeptide. 

Preferably, such a sequence is at least 75%, more preferably 80%, and most 
preferably 90% identical to an amino acid sequence shown in SEQ ID No: 3 or 4. 

Homologous amino acid sequences include sequences that are identical or 
substantially identical to an amino acid sequence as shown in SEQ ID No:3 or 4. By "amino 
acid sequence- substantially identical" is meant a sequence. that is at least 90%, preferably 95%, 
more preferably 97%, and most preferably 99% identical to an amino acid sequence of reference 
and that preferably differs from the sequence of reference, if at all, by a majority of conservative 
amino acid substitutions. 

Conservative amino acid substitutions typically include substitutions among amino 
acids of the same class. These classes include, for example, amino acids having uncharged polar 
side chains, such as asparagine, glutamine, serine, threonine, and tyrosine; amino acids having 
basic side chains, such as lysine, arginine, and histidine; amino acids having acidic side chains, 
such as aspartic acid and glutamic acid; and amino acids having nonpolar side chains, such as 
glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, and 
cysteine. 

Homology is typically measured using sequence analysis software (e.g., Sequence 
Analysis Software Package of the Genetics Computer Group, University of Wisconsin 
Biotechnology Center, 1710 University Avenue, Madison, WI 53705). Similar amino acid 
sequences are aligned to obtain the maximum degree of homology (z.e., identity). To this end, it 
may be necessary to artificially introduce gaps into the sequence. Once the optimal alignment 
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has been set up, the degree of homology {i.e., identity) is established by recording all of the 
positions in which the amino acids of both sequences are identical, relative to the total number of 
positions. 

Homologous polynucleotide sequences are defined in a similar way. Preferably, a 
5 homologous sequence is one that is at least~45%, more preferably 60%^and most preferably 85% 
identical to (i) a coding sequence^of SEQJD NOs: l and 2. 

Polypeptides having^a sequence 1 homologous to one of the* sequenees^shown in SEQ 
ID NO: 3 or 4 , include naturally-occurring allelic variants, as well as mutants-or any other non- 
naturally occurring variants that are analogous in terms of antigenicity, to a polypeptide having a 
10 sequence as shown in SEQ ID NO: 3 or 4. 

As is known in the art, an allelic variant is an alternate form of a polypeptide that is 
characterized as having a substitution, deletion, or addition of one or more amino acids that does 
^ not alter the biological function of the polypeptide. By "biological function" is meant the 
□ function of the polypeptide in the cells in which it naturally occurs, even if the function is not 
Ss necessary for the growth or-survival" of the cells. For example, the biological-function of a porin 
fl . is to allow the entry into cells of compounds present^in the extracellular m^ 

W function is distinct from the antigenic function: A polypeptide canvhave more than one 

S3 

5 biological function:^ r 

-U Allelic v ariants^exvei y^^ 

jjfO pneumoniae, is usuallyerepreseE^dvby.a^ minor 

iyp allelic variations. Indeedf^a polypeptide that fulfills the~*same biological function in different 

83 

strains can have an amino acid sequence that is not identical in each of the strains. Such an 

allelic variation may be equally reflected at the polynucleotide level. 

Support for the use of allelic variants of polypeptide antigens comes from, e.g., 
25 studies of the Chlamydial MOMP antigen. The amino acid sequence of the MOMP varies from 

strain to strain, yet cross-strain antibody binding plus neutralization of infectivity occurs, 

indicating that the MOMP, when used as an immunogen, is tolerant of amino acid variations. 

Polynucleotides, we. g., DNA molecules, _ encoding^llelic variants*. can. .easily be 

retrieved by polymerase chain reaction (PGR) amplification of genomicibaeteriakDNA extracted > 
30 by conventional methdds.;^Th&m^ matching 

upstream and do wnstreamsof ^tllfebS* Md#V ends«6f thete^ be 

8 
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designed according to the nucleotide sequence information provided in SEQ ID NOs:l and 2. 
Typically, a primer can consist of 10 to 40, preferably 15 to 25 nucleotides. It may be also 
advantageous to select primers containing C and G nucleotides in a proportion sufficient to 
ensure efficient hybridization; e.g., an amount of C and G nucleotides of at least 40%, preferably 
50% of the total nucleotide amount. 

Useful homologs that do not naturally occur can be designed using known methods 
for identifying regions of an antigen that are likely to be tolerant of amino acid sequence changes 
and/or deletions. For example, sequences of the antigen from different species can be compared 
to identify conserved sequences. 

Polypeptide derivatives that are encoded by polynucleotides of the invention include, 
e.g., fragments, polypeptides having large internal deletions derived from full-length 
polypeptides, and fusion proteins. 

Polypeptide fragments of the invention can be derived from a polypeptide having a 
sequence homologous to any of the sequences shown in SEQ ID NO: 3 or 4 , to the extent that 
the fragments retain the substantial antigenicity of the parent polypeptide (specific antigenicity). 
Polypeptide derivatives can also be constructed by large internal deletions that remove a 
substantial part of the parent polypeptide, while retaining specific antigenicity. Generally, 
polypeptide derivatives should be about at least 12 amino acids in length to maintain 

acids, more preferably at least 75 amino acids, and most preferably at least 100 amino acids in 
length. 

Useful polypeptide derivatives, e.g., polypeptide fragments, can be designed using 
computer-assisted analysis of amino acid sequences in order to identify sites in protein antigens 
having potential as surface-exposed, antigenic regions (Ref 37). 

Polypeptide fragments and polypeptides having large internal deletions can be used 
for revealing epitopes that are otherwise masked in the parent polypeptide and that may be of 
importance for inducing a protective T cell-dependent immune response. Deletions can also 
remove immunodominant regions of high variability among strains. 

It is an accepted practice in the field of immunology to use fragments and variants of 
protein immunogens as vaccines, as all that is required to induce an immune response to a 
protein is a small (e.g., 8 to 10 amino acid) immunogenic region of the protein. This has been 
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done for a number of vaccines against pathogens other than Chlamydia. For example, short 
synthetic peptides corresponding to surface-exposed antigens of pathogens such as murine 
mammary tumor virus, peptide containing 1 1 amino acids; (Ref 38), Semliki Forest virus, 
peptide containing 16 amino acids (Ref 39), and canine parvovirus, 2 overlapping peptides, each 
5 containing 1 5 aminp^aeids*»(Ref 40)? haveabeenashojvvnvto, be> effeetiveMvacGinelfentigens^against 
their resp§ptive*pathagens. 

Polynueleotddes encoding polypeptide fragments* *and poLypeptidesifchaving-large 
internal deIetions*ean(?Jbe gonst^u^d <msing m standai*t methods*(Ref*41)* for^example, by PCR, 
including inverse PCR, by restriction enzyme treatment of the cloned DNA molecules, or by the 
10 method of Kunkel et al. (Ref 42) biological material available at Stratagene. 

A polypeptide derivative can also be produced as a fusion polypeptide that contains a 
polypeptide or a polypeptide derivative of the invention fused, e.g., at the N- or C-terminal end, 
^ to any other polypeptide (hereinafter referred to as a peptide tail). Such a product can be easily 
O obtained by translation of a genetic fusion, i.e. 9 a hybrid gene. Vectors for expressing fusion 
jj5 polypeptides*- are**commereiadly* availafele$wsuelr as ^the*pMaltc2^or pMakp2 ^systems* of New 
iji England Biolabs, in which the. peptide -taiLus a maltose- binding^prptein^,the#glutathione-S- 
JjJ transferase ^system of,Pharmacia,-or the^His-Tag*s#ste^ These and 

= other expression^system$*pi;ovidetfcon^ and 
^ deri vati ves^f^he^nventfen?^ — 
So Another*p'ai&eular^ includes a 

polypeptide *or polypeptideVderivfatdve*of the^ invention*fused*tg a pol^peptide**having adjuvant 
activity, such as, e.g., subunit B of either cholera toxin or £ coli heat-labile toxin. Several 
possibilities are can be used for achieving fusion. First, the polypeptide of the invention can be 
fused to the N-, or preferably, to the C-terminal end of the polypeptide having adjuvant activity. 
25 Second, a polypeptide fragment of the invention can be fused within the amino acid sequence of 
the polypeptide having adjuvant activity. 

As stated above, the polynucleotides of the invention encode Chlamydia polypeptides 
in precursor or mature, form.. ^.They can also^encode %brid*pr£C.ursors*eon^ 
signal peptides, which can; mature*:ihto*pol^ signal 
30 peptide" is meant a signd#peptidie¥thattis not ifdutidjin tthe^natui^^ a 
polypeptide of the invention* * 
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A polynucleotide of the invention, having a homologous coding sequence, hybridizes, 
preferably under stringent conditions, to a polynucleotide having a sequence as shown in SEQ ID 
NOs:l to 2. Hybridization procedures are, e.g., described in Ausubel et ah, (Ref 41), Silhavy et 
ah (Ref 43); Davis et ah (ref 44). Important parameters that can be considered for optimizing 
hybridization conditions are reflected in a formula that allows calculation of a critical value, the 
melting temperature above which two complementary DNA strands separate from each other Ref 
45). This formula is as follows: Tm = 81.5 + 0.5 x (% G+C) +1.6 log (positive ion 
concentration) - 0.6 x (% fonnamide). Under appropriate stringency conditions, hybridization 
temperature (Th) is approximately 20 to 40°C, 20 to 25°C, or, preferably 30 to 40°C below the 
calculated Tm. Those skilled in the art will understand that optimal temperature and salt 
conditions can be readily determined empirically in preliminary experiments using conventional 
procedures. 

For example, stringent conditions can be achieved, both for pre-hybridizing and 
hybridizing incubations, (i) within 4-16 hours at 42°C, in 6 x SSC containing 50% formamide or 
(ii) within 4-16 hours at 65°C in an aqueous 6 x SSC solution (1 M NaCl, 0.1 M sodium citrate 
(pH 7.0)). . . 

For polynucleotides containing 30 to 600 nucleotides, the above formula is used and 
then is corrected by subtracting (600/polynucleotide size in base pairs). Stringency conditions 
^are defined Jby a-Th^that is 5 to-l^^€4)elow Tm. 

Hybridization conditions with oligonucleotides shorter than 20-30 bases do not 
exactly follow the rules set forth above. In such cases, the formula for calculating the Tm is as 
follows: Tm = 4x (G+C) + 2 (A+T). For example, an 18 nucleotide fragment of 50% G+C 
would have an approximate Tm of 54°C. 

A polynucleotide molecule of the invention, containing RNA, DNA, or modifications 
or combinations thereof, can have various applications. For example, a DNA molecule can be 
used (i) in a process for producing the encoded polypeptide in a recombinant host system, (ii) in 
the construction of vaccine vectors such as poxviruses, which are further used in methods and 
compositions for preventing and/or treating Chlamydia infection, (iii) as a vaccine agent (as well 
as an RNA molecule), in a naked form or formulated with a delivery vehicle and, (iv) in the 
construction of attenuated Chlamydia strains that can over-express a polynucleotide of the 
invention or express it in a non-toxic, mutated form. 
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According to a second aspect of the invention, there is therefore provided (i) an 
expression cassette containing a DNA molecule of the invention placed under the control of the 
elements required for expression, in particular under the control of an appropriate promoter; (ii) 
an expression vector containing an expression cassette of the invention; (iii) a procaryotic or 
5 eucaryotic cell transfoimed - o^ of the 

invention, as well as (iv) a process for producing a polypeptide or polypeptide derivative- 
encoded by a polynucleotide of the invention, which involves culturing a procaryotic or 
eucaryotic cell transformed-or transfected with an expression cassette and/or vector of the 
invention, under conditions that allow expression of the DNA molecule of the invention and, 
10 recovering the encoded polypeptide or polypeptide derivative from the cell culture. 

A recombinant expression system can be selected from procaryotic and eucaryotic 
hosts. Eucaryotic hosts include yeast cells (e.g., Saccharomyces cerevisiae or Pichia pastoris), 
mammalian cells (e.g., COS1, NIH3T3, or JEG3 cells), arthropods cells {e.g., Spodoptera 
G frugiperda (SF9) cells), and plant cells. Preferably, a procaryotic host such as E. coli is used. 
05 Bacterial and eucaryoticxellsjare available from a number^of differentosourees to those skilled in 
Tj, the art, e.g., the American Type Culture Collection (ATCC; RockviUei? Maryland). 
H Thexhoice of theexpression system depends on the features desired-fpr the expressed 

s polypeptide.^ For example$$it: rnay^ber useful to producesa polypeptide of >the#invention in a 
t*- particular-lipidated^or^^ 

3(o The choice ofcthetexpressioi^ well 

Jfj as the features desired for the expressedpolypeptide. Typically, an expression cassette includes 

Mi? 

a promoter that is functional in the selected host system and can be constitutive or inducible; a 
ribosome binding site; a start codon (ATG) if necessary, a region encoding a signal peptide, e.g., 
a lipidation signal peptide; a DNA molecule of the invention; a stop codon; and optionally a 3' 

25 terminal region (translation and/or transcription terminator). The signal peptide encoding region 
is adjacent to the polynucleotide of the invention and placed in proper reading frame. The signal 
peptide-encoding region can be homologous or heterologous to the DNA molecule encoding the 
mature polypeptide and can, be specificto the secretion<apparatusso£the hosfeusedtfor expression. 
The open reading frame constituted-: by the DNA molecule of the invention, -solely or together - 

30 with the signaLpeptide;Js placed under the^control: of the promoter* so , that transcription and 
translation occur in the host systems ' Promoters^- signal peptide encoding regions are widely 
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known and available to those skilled in the art and includes, for example, the promoter of 
Salmonella typhimurium (and derivatives) that is inducible by arabinose (promoter araB) and is 
functional in Gram-negative bacteria such as E, coli (as described in U.S. Patent No. 5,028,530 
and in Cagnon et al. 9 (Ref 46); the promoter of the gene of bacteriophage T7 encoding RNA 
polymerase, that is functional in a number of E. coli strains expressing T7 polymerase (described 
in U.S. Patent No. 4,952,496); OspA Iipidation signal peptide ; and RIpB lipidation signal 
peptide (Ref 47). 

The expression cassette is typically part of an expression vector, which is selected for 
its ability to replicate in the chosen expression system. Expression vectors (e.g., plasmids or 
viral vectors) can be chosen from those described in Pouwels et al. (Cloning Vectors: A 
Laboratory Manual 1985, Supp. 1987). They can be purchased from various commercial 
sources. 

Methods for transforming/transfecting host cells with expression vectors will depend 
on the host system selected as described in Ausubel et aL, (Ref 41). 

Upon expression, a recombinant polypeptide of the invention (or a polypeptide 
derivative) is produced and remains in the intracellular compartment, is secreted/excreted in the 
extracellular medium or in the periplasmic space, or is embedded in the cellular membrane. The 
polypeptide can then be recovered in a substantially purified form from the cell extract or from 

polypeptide can be purified by antibody-based affinity purification or by any other method that 
can be readily adapted by a person skilled in the art, such as by genetic fusion to a small affinity 
binding domain. Antibody-based affinity purification methods are also available for purifying a 
polypeptide of the invention extracted from a Chlamydia strain. Antibodies useful for purifying 
by immunoaffinity the polypeptides of the invention can be obtained as described below. 

A polynucleotide of the invention can also be useful in the vaccine field, e.g., for 
achieving DNA vaccination. There are two major possibilities, either using a viral or bacterial 
host as gene delivery vehicle (live vaccine vector) or administering the gene in a free form, e.g., 
inserted into a plasmid. Therapeutic or prophylactic efficacy of a polynucleotide of the invention 
can be evaluated as described below. 

Accordingly, in a third aspect of the invention, there is provided (i) a vaccine vector 
such as a poxvirus, containing a DNA molecule of the invention, placed under the control of 
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elements required for expression; (ii) a composition of matter containing a vaccine vector of the 
invention, together with a diluent or carrier; particularly, (iii) a pharmaceutical composition 
containing a therapeutically or prophylactically effective amount of a vaccine vector of the 
invention; (iv) a method for inducing an immune response against Chlamydia in a mammal (e.g., 
5 a human; alternatively, the method can, be used in veterinary, applications ^for treating or 
preventing Chlamydia infection of animals^ e.g., cats or birds), which involves^administering to 
the mammal an immunogenically effective amount of a vaccine vector of the invention to elicit 
an immune response, e.g., a protective or therapeutic immune^ response to* Chlamydia ; and 
particularly, (v) a method for preventing and/or treating a Chlamydia (e.g., C. trachomatis, C. 
10 psittaci, C. pneumonia, C. pecorum) infection, which involves administering a prophylactic or 
therapeutic amount of a vaccine vector of the invention to an individual in need. Additionally, 
the third aspect of the invention encompasses the use of a vaccine vector of the invention in the 

?K preparation of a medicament for preventing and/or treating Chlamydia infection. 

p A vaccine vector of the invention can express one or several polypeptides or 

Jgs derivatives of the- 'invention, as well as at least one additional Chlamydia antigen, fragment, 
homolog, mutant, or derivative thereof. In addition, it can express a cytokine, such as 
interleukin-2 (IL-2) or interleukin-12 (IL- 1 2), that* enhances the ; 4mmune v response (adjuvant 

f effect). Thus, a vaccineS^ctoncan^ a 

1 4- ehlamydial-^tigenrr^r^cytoki^ 



SSo in a mammalian cell. 

22 Alternatively, a composition of the invention can, include several yaccine vectors, 

each of them being capable of expressing a polypeptide or derivative of the invention. A 
composition can also contain a vaccine vector capable of expressing an additional Chlamydia 
antigen, or a subunit, fragment, homolog, mutant, or derivative thereof; or a cytokine such as IL- 

25 2 orIL-12. 

In vaccination methods for treating or preventing infection in a mammal, a vaccine 
vector of the invention can be administered by any conventional route in use in the vaccine field, 
particularly, to a mucosal (e.g., ocular, intranasal; .oral, , gastric, pulmonary,, intestinal, rectal, 
vaginal, or urinary tract) surface or via the parenteral (e.g., .subcutaneous,; •; intradermal, 
30 intramuscular, intravenous, or intraperitoneal) route. Preferred routes depend upon the choice of 
the vaccine vector. The administration can^be achieved^in a single dose* or repeated*at intervals. 

14 
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The appropriate dosage depends on various parameters understood by skilled artisans such as the 
vaccine vector itself, the route of administration or the condition of the mammal to be vaccinated 
(weight, age and the like). 

Live vaccine vectors available in the art include viral vectors such as adenoviruses 
and poxviruses as well as bacterial vectors, e.g., Shigella, Salmonella, Vibrio cholerae, 
Lactobacillus, Bacille bilie de Calmette-Guerin (BCG), and Streptococcus. 

An example of an adenovirus vector, as well as a method for constructing an 
adenovirus vector capable of expressing a DNA molecule of the invention, are described in 
U.S. Patent No. 4,920,209. Poxvirus vectors that can be used include, e.g., vaccinia and canary 
pox virus, described in U.S. Patent No. 4,722,848 and U.S. Patent No. 5,364,773, respectively 
(also see, e.g. y Tartaglia et aL, Virology (1992) 188:217) for a description of a vaccinia virus 
vector; and Taylor et al, Vaccine (1995) 13:539 for a reference of a canary pox). Poxvirus 
vectors capable of expressing a polynucleotide of the invention can be obtained by homologous 
recombination as described in Kieny et al., Nature (1984) 312:163 so that the polynucleotide of 
the invention is inserted in the viral genome under appropriate conditions for expression in 
mammalian cells.^Generally , the_dose_of _vaccine_viral jvector, for_therapeutic_or„prophylactic use, 
can be of from about IxlO 4 to about lxlO 11 , advantageously from about lxlO 7 to about IxlO 10 , 
preferably of from about lxlO 7 to about lxlO 9 plaque-forming units per kilogram. Preferably, 



%ip in the art recognize that it is preferable to avoid adding a chemical adjuvant to a composition 
"^3 containing a viral vector of the invention and thereby minimizing the immune response to the 
viral vector itself. 

Non-toxicogenic Vibrio cholerae mutant strains that are useful as a live oral vaccine 
are described in Mekalanos et al. y Nature (1983) 306:551 and U.S. Patent No. 4,882,278 (strain 

25 in which a substantial amount of the coding sequence of each of the two ctxA alleles has been 
deleted so that no functional cholerae toxin is produced); WO 92/1 1354 (strain in which the irgA 
locus is inactivated by mutation; this mutation can be combined in a single strain with ctxA 
mutations); and WO 94/1533 (deletion mutant lacking functional ctxA and attRSl DNA 
sequences). These strains can be genetically engineered to express heterologous antigens, as 

30 described in WO 94/19482. An effective vaccine dose of a Vibrio cholerae strain capable of 
expressing a polypeptide or polypeptide derivative encoded by a DNA molecule of the invention 
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can contain, e.g., about 1x10 s to about lxlO 9 , preferably about lxlO 6 to about 1x10 s viable 
bacteria in an appropriate volume for the selected route of administration. Preferred routes of 
administration include all mucosal routes; most preferably, these vectors are administered 
intranasally or orally. 

expression of ^heterologous- antigens*or not, and their^usejtas ord^vaccines*arek*described in 
Nakayama et *<al: (Bid/SEechnolog^ (1988) 6:693) and** WO 92/1T361* Preferred routes of 
administration include all mucosal, routes? most preferably, these vectors are ^administered 
intranasally or orally. 

10 Others bacterial strains useful as vaccine vectors are described in High et aL, EMBO 

(1 992) 1 1 : 1 99 1 and Sizemore et aL, Science (1 995) 270:299 (Shigella Jlexneri); Medaglini et aL, 
Proc. Natl. Acad. Sci. USA (1995) 92:6868 (Streptococcus gordonii); and Flynn J.L., Cell. Mol. 
Biol. (1994) 40 (suppl. I):31, WO 88/6626, WO 90/0594, WO 91/13157, WO 92/1796, and WO 
£3 92/2 1 376 (Bacille Calmette Guerin). 

r*j5 In bacterial vectors^polynucleotide^of the 4nvention(»ean*be inserted into the bacterial 

fl genome or can remain in a freestate^carried on a plasmid;.* - 

An adjuMan^can*a}sp;ibe addedto a composition*eontainingta^aecine#bacterial sector. 
£ A number of adjuvantsme4cno,wn*to those *slrilil^*in*the*ai£^ selected 
I A from^tiie4ist provided-below.. 

;?p According to a*fourth*aspe.ct«iof the4nvention^there Js alsQ*provide!d« composition 

lU 

v8 of matter containing a polynucleotide of the invention, together with,a diluent or carrier (ii) a 

ih 

pharmaceutical composition containing a therapeutically or prophylactically effective amount of 
a polynucleotide of the invention; (iii) a method for inducing an immune response against 
Chlamydia, in a mammal, by administering to the mammal, an immunogenically effective 

25 amount of a polynucleotide of the invention to elicit an immune response, e.g., a protective 
immune response to Chlamydia; and particularly, (iv) a method for preventing and/or treating a 
Chlamydia (e.g., C trachomatis, C psittaci, C pneumoniae, or C pecorum) infection, by 
administering a prophylactic or therapeutical an - 

individual in need., AdditionalLy,*the ;fourth.-aspect*of the;>invention^ encQmpasses»the«use of a 

30 polynucleotide of the inventiontin w the^preparation*of a"medic^ent|for preyehtingimdjior^treating ^ . 
Chlamydia infection. The fourth aspect of the invention*preferably*includes the *use of a DNA 
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molecule placed under conditions for expression in a mammalian cell, e.g., in a plasmid that is 
unable to replicate in mammalian cells and to substantially integrate in a mammalian genome. 

Polynucleotides (DNA or RNA) of the invention can also be administered as such to a 
mammal for vaccine, e.g., therapeutic or prophylactic, purpose. When a DNA molecule of the 
invention is used, it can be in the form of a plasmid that is unable to replicate in a mammalian 
cell and unable to integrate in the mammalian genome. Typically, a DNA molecule is placed 
under the control of a promoter suitable for expression in a mammalian cell. The promoter can 
function ubiquitously or tissue-specifically. Examples of non-tissue specific promoters include 
the early Cytomegalovirus (CMV) promoter (described in U.S. Patent No. 4,168,062) and the 
Rous Sarcoma Virus promoter (described in Norton & Coffin, Moiec. Cell Biol. (1985) 5:281). 
The desmin promoter (Li et al, Gene (1989) 78:243, Li & Paulin, J. BioL Chem. (1991) 
266:6562 and Li & Paulin, J. Biol. Chem. (1993) 268:10403) is tissue-specific and drives 
expression in muscle cells. More generally, useful vectors are described, La. 9 WO 94/21797 and 
Hartikka et al., Human Gene Therapy (1996) 7:1205. 

For DNA/RNA vaccination, the polynucleotide of the invention can encode a 
precursor or a mature form. _When.it encodes a precursor form,-the -precursor-form can be 
homologous or heterologous. In the latter case, a eucaryotic leader sequence can be used, such as 
the leader sequence of the tissue-type plasminogen factor (tPA). 



J|f) invention. It can also contain at least one additional polynucleotide encoding another Chlamydia 

h3 antigen such as urease subunit A, B, or both; or a fragment, derivative, mutant, or analog thereof. 
A polynucleotide encoding a cytokine, such as interleukin-2 (IL-2) or interleukin-12 (IL-12), can 
also be added to the composition so that the immune response is enhanced. These additional 
polynucleotides are placed under appropriate control for expression. Advantageously, DNA 

25 molecules of the invention and/or additional DNA molecules to be included in the same 
composition, can be carried in the same plasmid. 

Standard techniques of molecular biology for preparing and purifying polynucleotides 
can be used in the preparation of polynucleotide therapeutics of the invention. For use as a 
vaccine, a polynucleotide of the invention can be formulated according to various methods. 

30 First, a polynucleotide can be used in a naked form, free of any delivery vehicles, 

such as anionic liposomes, cationic lipids, microparticles, e.g., gold microparticles, precipitating 
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agents, e.g., calcium phosphate, or any other transfection-facilitating agent. In this case, the 
polynucleotide can be simply diluted in a physiologically acceptable solution, such as sterile 
saline or sterile buffered saline, with or without a carrier. When present, the carrier preferably is 
isotonic, hypotonic, or weakly hypertonic, and has a relatively low ionic strength, such as 
provided by &sucrose>sMutio~£^^ 

Alternatively," a polynueleotide^can^be-assoeiated^wiftsa^ 
uptake. It can be, /.a, (i) complemented with a chemical* agent that^modifi^rs«ithevGellular^ 
permeability; such as bupivaeaine (see^e.g-., WO 94M6737), (ii)-eneapsulatedpintowliposomes, or 
(iii) associated with cationic lipids or silica, gold, or tungsten microparticles. 

Anionic and neutral liposomes are well-known in the art (see, e.g., Liposomes: A 
Practical Approach, RPC New Ed, IRL press (1990), for a detailed description of methods for 
making liposomes) and are useful for delivering a large range of products, including 
polynucleotides. 

Cationic lipids are also known in the art and are commonly used for gene delivery. 
Such lipids ineludeafcapo 

trimethylammonium * chloride), DOTAP . ( 1 ,2Tbis(oleylox^>3^(tiimeth^^ 

DDAB (dimethyldioGtadejeylamm 

and cholesterol derivatives suGh^as^DC^<Shfpli^(3 1b,eta^p^£@*l^ 
carbamoyi)=ehQlestei ^l). ^ A^escription-<)f^ese-cfrti^ 

WO 90/11 092,- JJ.S. ^Pateht^NowS,283^ Patent 
No. 5,527,928r Cationiedipids*fqr gei^ 

lipid such as DOPE (dioleyl phosphatidylethanolamine), as, for example, described in 
WO 90/11092. 

Other transfection-facilitating compounds can be added to a formulation containing 
cationic liposomes. A number of them are described in, e.g., WO 93/18759, WO 93/19768, WO 
94/25608, and WO 95/2397. They include, /.a., spermine derivatives useful for facilitating the 
transport of DNA through the nuclear membrane (see, for example, WO 93/1 8759) and 
membrane-permeabilizing^eompounds isuch^as ,GALA,^Granii'erdinewSj and«eatiohic bile-salts 
(see, for example, * WO 93'/ 19768):4* 

Gold ^or tungstemmieropaiti^ described in 

WO 91/359, WO 93/177015^^^ the - 
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microparticle-coated polynucleotides can be injected via intradermal or intraepidermal routes 
using a needleless injection device ("gene gun"), such as those described in U.S. Patent 
No. 4 5 945 3 050, U.S. Patent No. 5,015,580, and WO 94/24263. 

The amount of DNA to be used in a vaccine recipient depends, e.g., on the strength of 
the promoter used in the DNA construct, the immunogenicity of the expressed gene product, the 
condition of the mammal intended for administration (e.g., the weight, age, and general health of 
the mammal), the mode of administration, and the type of formulation. In general, a 
therapeutically or prophylactically effective dose from about 1 \ig to about 1 mg, preferably, 
from about 10 \xg to about 800 jag and, more preferably, from about 25 \ig to about 250 fig, can 
be administered to human adults. The administration can be achieved in a single dose or 
repeated at intervals. 

The route of administration can be any conventional route used in the vaccine field. 
As general guidance, a polynucleotide of the invention can be administered via a mucosal 
surface, e.g., an ocular, intranasal, pulmonary, oral, intestinal, rectal, vaginal, and urinary tract 
surface; or via a parenteral route, e.g., by an intravenous, subcutaneous, intraperitoneal, 

.intrademial, Jntraepideimal, orJ^ will 
depend on, e.g., the formulation that is selected. A polynucleotide formulated in association with 
bupivacaine is advantageously administered into muscles. When a neutral or anionic liposome or 

-a cationic lipid, sueh - as-BOTMA or DOChol, is used y- tb e=fer mulation can-be-advantageously— 



M) injected via intravenous, intranasal (aerosolization), intramuscular, intradermal, and 
v3 subcutaneous routes. A polynucleotide in a naked form can advantageously be administered via 

ii 

the intramuscular, intradermal, or sub-cutaneous routes. 

Although not absolutely required, such a composition can also contain an adjuvant. If 
so, a systemic adjuvant that does not require concomitant administration in order to exhibit an 
25 adjuvant effect is preferable such as, e.g., QS21, which is described in U.S. Patent No. 5,057,546. 

The sequence information provided in the present application enables the design of 
specific nucleotide probes and primers that can be useful in diagnosis. Accordingly, in a fifth 
aspect of the invention, there is provided a nucleotide probe or primer having a sequence found 
in or derived by degeneracy of the genetic code from a sequence shown in SEQ ID NO:l to 2. 
30 The term "probe" as used in the present application refers to DNA (preferably single 

stranded) or RNA molecules (or modifications or combinations thereof) that hybridize under the 
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stringent conditions, as defined above, to nucleic acid molecules having sequences homologous 
to those shown in SEQ ID NOsrl and 2, or to a complementary or anti-sense sequence. 
Generally, probes are significantly shorter than full-length sequences shown in SEQ ID NOs: 1 
and 2; for example, they can contain from about 5 to about 100, preferably from about 10 to 
about 80 nucleotides. In particular, probes- have sequences thatcare::at least;; ? 75%, preferably at 
least 85%, more preferably 95% homologous to a portion of a sequence asoshown-in SEQ ID 
NOs:l and 2 or that are complementary to such sequences. Probes can contain modified bases 
such as inosine, methyl- 5 -deoxycytidine, deoxyuridine, dimethylamino-5-deoxyuridine, or 
diamino-2, 6-purine. Sugar or phosphate residues can also be modified or substituted. For 
example, a deoxyribose residue can be replaced by a polyamide (Nielsen et al. 9 Science (1991) 
254:1497) and phosphate residues can be replaced by ester groups such as diphosphate, alkyl, 
arylphosphonate and phosphorothioate esters. In addition, the 2'-hydroxyl group on 
ribonucleotides can be modified by including, e.g., alkyl groups. 

Probes of the invention can be used in diagnostic tests, as capture or detection probes. 
Such capture probes can be^conventionally immobilize#on~a solid support£ldireetly or indirectly, 
by covalent means or by passive adsorption. A detection probe can be labelled^by a detection 
marker selected from radioactive isotopes; enzymes such as peroxidase,; alkaline phosphatase, 
and enzymes able to hydrolyze a chromogenic, fluorogenic;40r:luminescent substrate; compounds 
^thatarexhrcmrogenicrfi^ ancHbiotin. = 

Probes of the^invention^can^be^used*^ such 
as dot blot (Maniatise/ a/., Molecular Cloning • A Laboratory Manual (1982) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York), Southern blot (Southern, J. Moh Biol. (1975) 
98:503), northern blot (identical to Southern blot to the exception that RNA is used as a target), 
or the sandwich technique (Dunn et aL, Cell (1 977) 12:23). The latter technique involves the use 
of a specific capture probe and/or a specific detection probe with nucleotide sequences that at 
least partially differ from each other. 

A primer is usually a probe of about 10 to about 40 nucleotides that is used to initiate 
enzymatic polymerization^pf .DNA in an .ampHfiea^ 

process, or in a reverse^transcriptionvmethod. In a diag^stfe'm primers 
can be labelled. 
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Thus, the invention also encompasses (i) a reagent containing a probe of the invention 
for detecting and/or identifying the presence of Chlamydia in a biological material; (ii) a method 
for detecting and/or identifying the presence of Chlamydia in a biological material, in which (a) a 
sample is recovered or derived from the biological material, (b) DNA or RNA is extracted from 
5 the material and denatured, and (c) exposed to a probe of the invention, for example, a capture, 
detection probe or both, under stringent hybridization conditions, such that hybridization is 
detected; and (iii) a method for detecting and/or identifying the presence of Chlamydia in a 
biological material, in which (a) a sample is recovered or derived from the biological material, 
t, (b) DNA is extracted therefrom, (c) the extracted DNA is primed with at least one, and preferably 
10 two, primers of the invention and amplified by polymerase chain reaction, and (d) the amplified 
DNA fragment is produced. 

As previously mentioned, polypeptides that can be produced upon expression of the 
newly identified open reading frames are useful vaccine agents. 
£3 Therefore, a sixth aspect of the invention features a substantially purified polypeptide 
I fi5 or polypeptide derivative having an amino acid sequence encoded by a polynucleotide of the 
r= invention. _ . . . 

L- 1 ■ - - .... 

n A "substantially purified polypeptide" is defined as a polypeptide that is separated 

= from the environment in which it naturally occurs and/or that is free of the majority of the 

-L p a l ypeptides th at are p resent4n^th^ nyironment in which Jt was synthesized. For examples, 

zip substantially purified polypeptide is free from cytoplasmic polypeptides. Those skilled in the art 

F J 

V3 will understand that the polypeptides of the invention can be purified from a natural source, i.e. , a 
Chlamydia strain, or can be produced by recombinant means. 

Homologous polypeptides or polypeptide derivatives encoded by polynucleotides of 
the invention can be screened for specific antigenicity by testing cross-reactivity with an 

25 antiserum raised against the polypeptide of reference having an amino acid sequence as shown in 
SEQ ID NOs:3 to 4 . Briefly, a monospecific hyperimmune antiserum can be raised against a 
purified reference polypeptide as such or as a fusion polypeptide, for example, an expression 
product of MBP, GST, or His-tag systems or a synthetic peptide predicted to be antigenic. The 
homologous polypeptide or derivative screened for specific antigenicity can be produced as such 

30 or as a fusion polypeptide. In this latter case and if the antiserum is also raised against a fusion 
polypeptide, two different fusion systems are employed. Specific antigenicity can be determined 
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according to a number of methods, including Western blot (Towbin et al., Proc. Natl. Acad, Sci. 
USA (1979) 76:4350), dot blot, and ELISA, as described below. 

In a Western blot assay, the product to be screened, either as a purified preparation or 
a total E. coli extract, is submitted to SDS-Page electrophoresis as described by Laemmli (Nature 
(1970) 227:680)^ AfterJiransfe^^ further incubated 

with the monospejeific^yperimmune^ 1*5 to 

about 1 iSOOO^preferably^from^abouM : 1 00 4 to about 1 :500MrSpecifieiantigenici^is shown 1 once a 
band corresponding^©- the*produet*exhib^^ anyiof ^theYdilutions*inftthe*above range. 

In an ELISA assay, the product to be screened is preferably used as the coating 
antigen. A purified preparation is preferred, although a whole cell extract can also be used. 
Briefly, about 100 \xl of a preparation at about 10 \ig protein/ml are distributed into wells of a 96- 
well polycarbonate ELISA plate. The plate is incubated for 2 hours at 37°C then overnight at 
4°C. The plate is washed with phosphate buffer saline (PBS) containing 0.05% Tween 20 
(PBS/Tween buffer). The wells are saturated with 250 jil PBS containing 1% bovine serum 
albumin (BSA>to prevent ♦noh-specifie^tibody, bindings 37°C, the 

plate is -washed with PBS J Mween$buff^ buffer 
containing i0.5%> BSA. 100^1 of ^dilutions «are^added*P£r^ ^incubated for 

90 minutes *at ^3 7°Q washedfcand^valuated^^ a 



in rabbits. IncubationHSxan?ied«oi^ reaction 
is developed— witb the%appropmate^ reaction is measured by colorimetry 

(absorbance measured spectrophotometrically). Under the above experimental conditions, a 
positive reaction is shown by O.D. values greater than a non immune control serum. 

In a dot blot assay, a purified product is preferred, although a whole cell extract can 
also be used. Briefly, a solution of the product at about 1 00 ng/ml is serially two-fold diluted in 
50 mM Tris-HCl (pH 7.5). 100 y\ of each dilution are applied to a nitrocellulose membrane 0.45 
\xm set in a 96-well dot blot apparatus (Biorad). The buffer is removed by applying vacuum to 
the system. Wells are washed by addition ofcS 0' mM jJrissHGL(pIJ ,7?5) and^the^memferane. is air- 
dried. The membrane*is saturated*in bldekirtg^ 

1 0 g/L skim milk) andwncubatedovith ^antisei^i^ 50!5tb>abciuKl: 5 000, 

preferably about 1 :500r Th^reaction^ 
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a goat anti-rabbit peroxidase conjugate is added to the wells when rabbit antibodies are used. 
Incubation is carried out 90 minutes at 37°C and the blot is washed. The reaction is developed 
with the appropriate substrate and stopped. The reaction is measured visually by the appearance 
of a colored spot, e.g., by colorimetry. Under the above experimental conditions, a positive 
reaction is shown once a colored spot is associated with a dilution of at least about 1:5, 
preferably of at least about 1 :500. 

Therapeutic or prophylactic efficacy of a polypeptide or derivative of the invention 
can be evaluated as described below. 

According to a seventh aspect of the invention, there is provided (i) a composition of 
matter containing a polypeptide of the invention together with a diluent or carrier; in particular, 
(ii) a pharmaceutical composition containing a therapeutically or prophylactically effective 
amount of a polypeptide of the invention; (iii) a method for inducing an immune response against 
Chlamydia in a mammal, by administering to the mammal an immunogenically effective amount 
of a polypeptide of the invention to elicit an immune response, e.g., a protective immune 
response to Chlamydia; and particularly, (iv) a method for preventing and/or treating a 
Chlamydia (e.g., C. trachomatis, C psittaci, C. pneumoniae, or C. pecorum) infection, by 
administering a prophylactic or therapeutic amount of a polypeptide of the invention to an 
individual in need. Additionally, the seventh aspect of the invention encompasses the use of a 
4^Qlypeptide_QOheJnvention in the preparation of a medicament for preventing and/or treating— 



go Chlamydia infection. 

The immunogenic compositions of the invention can be administered by any 

0^ conventional route in use in the vaccine field, in particular to a mucosal (e.g., ocular, intranasal, 
pulmonary, oral, gastric, intestinal, rectal, vaginal, or urinary tract) surface or via the parenteral 
(e.g., subcutaneous, intradermal, intramuscular, intravenous, or intraperitoneal) route. The 

25 choice of the administration route depends upon a number of parameters, such as the adjuvant 
associated with the polypeptide. For example, if a mucosal adjuvant is used, the intranasal or 
oral route will be preferred and if a lipid formulation or an aluminum compound is used, the 
parenteral route will be preferred. In the latter case, the sub-cutaneous or intramuscular route is 
most preferred. The choice can also depend upon the nature of the vaccine agent. For example, 

30 a polypeptide of the invention fused to CTB or LTB will be best administered to a mucosal 
surface. 
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A composition of the invention can contain one or several polypeptides or derivatives 
of the invention. It can also contain at least one additional Chlamydia antigen, or a subunit, 
fragment, homolog, mutant, or derivative thereof. 

For use in a composition of the invention, a polypeptide or derivative thereof can be 
5 formulated into, »or~ withw JfyQf^ ^iomefcJipQ^ 

ISCOMS/ or ^vinis-lik^spacliGles«'-(V£'Bs) w to faeilitate^deliveiy and/of*^enhanee r the^immune; ^ 
y ^ response. These^epmpounds are^readily available to one skilled in the* art; for example, see ^ 
Liposomes: A PraLGti^»Approaeh»<(ywpm)^ 

Adjuvants other than liposomes and the like can also be used and are known in the 
10 art. A appropriate selection can conventionally be made by those skilled in the art, for example, 
from the list provided below. 

Administration can be achieved in a single dose or repeated as necessary at intervals 
as can be determined by one skilled in the art. For example, a priming dose can be followed by 
three booster doses at weekly or monthly intervals. An appropriate dose depends on various 
15 parameters including4their.pcipiientt(ei ) g4V;adul^or infant»)^he^partiGular«Maccine ainigen v the route 
- and frequencyiof administratiQn^the«presenqe/absenee or type«of adju^ant^and the^desired effect 
rj (e*g-> protection ^d^o^treatment^as can>be^deteimined*by4oneiskilled*inithe^art'!^feIn general, a 
^ vaccine antigen^ o&theiinventiontean^ from about 

j^J 1 0 Mg to about 400 nig,i,picfeiaM 

fflo administration? the doseiusuallyishould^ 

CO 

I" When*usedtas vaeeine*agents^poliyn^^^ can 

~?2 be used sequentially as part of a multistep immunization process. For example, a mammal can 

£5 be initially primed with a vaccine vector of the invention such as a pox virus, e.g., via the 

ry 

y g parenteral route, and then boosted twice with the polypeptide encoded by the vaccine vector, e.g., 
4*5 via the mucosal route. In another example, liposomes associated with a polypeptide or derivative 
of the invention can also be used for priming, with boosting being carried out mucosally using a 
soluble polypeptide or derivative of the invention in combination with a mucosal adjuvant (e.g., 
LT). 

A polypeptid^derivativefcof th^^ 
30 detecting thepresenfcesof anti£<?M^ 

are about 5 to about 80, Mefe^ labeled 
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or unlabeled, depending upon the diagnostic method. Diagnostic methods involving such a 
reagent are described below. 

Upon expression of a DNA molecule of the invention, a polypeptide or polypeptide 
derivative is produced and can be purified using known laboratory techniques. For example, the 
polypeptide or polypeptide derivative can be produced as a fusion protein containing a fused tail 
that facilitates purification. The fusion product can be used to immunize a small mammal, e.g., a 
mouse or a rabbit, in order to raise antibodies against the polypeptide or polypeptide derivative 
(monospecific antibodies). The eighth aspect of the invention thus provides a monospecific 
antibody that binds to a polypeptide or polypeptide derivative of the invention. 

By "monospecific antibody" is meant an antibody that is capable of reacting with a 
unique naturally-occurring Chlamydia polypeptide. An antibody of the invention can be 
polyclonal or monoclonal. Monospecific antibodies can be recombinant, e.g., chimeric {e.g., 
constituted by a variable region of murine origin associated with a human constant region), 
humanized (a human immunoglobulin constant backbone together with hypervariable region of 
animal, e.g., murine, origin), and/or single chain. Both polyclonal and monospecific antibodies 
can also be, in the form of immunoglobulin, fragments, e.g., F(ab) ? 2.or Fab fragments. The 
antibodies of the invention can be of any isotype, e.g., IgG or IgA, and polyclonal antibodies can 
be of a single isotype or can contain a mixture of isotypes. 

The -a ntibodi e s-of th e invention , which are rais e d to a polypeptid e cu^-polypeptide- 



derivative of the invention, can be produced and identified using standard immunological assays, 
e.g., Western blot analysis, dot blot assay, or ELISA (see, e.g., Coligan et aL, Current Protocols 
in Immunology (1994) John Wiley & Sons, Inc., New York, NY). The antibodies can be used in 
diagnostic methods to detect the presence of a Chlamydia antigen in a sample, such as a 
biological sample. The antibodies can also be used in affinity chromatography methods for 
purifying a polypeptide or polypeptide derivative of the invention. As is discussed further 
below, such antibodies can be used in prophylactic and therapeutic passive immunization 
methods. 

Accordingly, a ninth aspect of the invention provides (i) a reagent for detecting the 
presence of Chlamydia in a biological sample that contains an antibody, polypeptide, or 
polypeptide derivative of the invention; and (ii) a diagnostic method for detecting the presence of 
Chlamydia in a biological sample, by contacting the biological sample with an antibody, a 
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polypeptide, or a polypeptide derivative of the invention, such that an immune complex is 
formed, and by detecting such complex to indicate the presence of Chlamydia in the sample or 
the organism from which the sample is derived. 

Those skilled in the art will understand that the immune complex is formed between a 
5 component of the^sample»an^ i s 
used, and that^any * unbound*material can be removed- prior*to detecting the*complex^ As *can be 
easily understood?^ polypeptide reagent is useful for detecting the. presence^ of anti-iShlamydia 
antibodies in a sample, e.g., a blood sample^ while an antibody of the invention A pan be used for 
screening a sample, such as a gastric extract or biopsy, for the presence of Chlamydia 
10 polypeptides. 

For use in diagnostic applications, the reagent (i.e., the antibody, polypeptide, or 
polypeptide derivative of the invention) can be in a free state or immobilized on a solid support, 
such as a tube, a bead, or any other conventional support used in the field. Immobilization can 

□ be achieved using direct or indirect means. Direct means include passive adsorption (non- 

g|5 covalent binding) or covalent4>itfding be^ 

tjj is meant that an anti-reagent compound that interacts <with«a reagent is first attached 4p the solid 
t| support. For exampl^if a poIy ( pep|kLe^ it can serve as 

s an anti-reagent, provided thatnit>binds%to an^epitop^*that^is*npt^invoLv^.4n the^recognition of 
antibodies-jnr4?iological^samples^4fidirect mcans^can^sor^n^cry-a ligand r receptor system^for- 
^0 example, a molecule^such$&%^vit^ 

tf3 corresponding receptor can be immobilized on the solid phase. This is illustrated by 4he biotin- 
streptavidin system. Alternatively, indirect means can be used, e.g., by adding to the reagent a 
peptide tail, chemically or by genetic engineering, and immobilizing the grafted or fused product 
by passive adsorption or covalent linkage of the peptide tail. 

25 According to a tenth aspect of the invention, there is provided a process for purifying, 

from a biological sample, a polypeptide or polypeptide derivative of the invention, which 
involves carrying out antibody-based affinity chromatography with the biological sample, 
wherein the antibody is a monospeeific-antibody of thennventions** 

For use in a purification process of the invention, the antibday&can^ 

30 monospecific, and preferably is <of the^gG^type,^ 

antiserum using standard metlfods (see, e.g.; Cdligan et aL 9 supra) ^ Conventional 
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chromatography supports, as well as standard methods for grafting antibodies, are disclosed in, 
e.g., Antibodies: A Laboratory Manual, D. Lane, E. Harlow, Eds, (1988). 

Briefly, a biological sample, such as an C. pneumoniae extract, preferably in a buffer 
solution, is applied to a chromatography material, preferably equilibrated with the buffer used to 
5 dilute the biological sample so that the polypeptide or polypeptide derivative of the invention 
(i.e., the antigen) is allowed to adsorb onto the material. The chromatography material, such as a 
gel or a resin coupled to an antibody of the invention, can be in batch form or in a column. The 
unbound components are washed off and the antigen is then eluted with an appropriate elution 
buffer, such as a glycine buffer or a buffer containing a chaotropic agent, e.g., guanidine HC1, or 
10 high salt concentration (e.g., 3 M MgCl 2 ). Eluted fractions are recovered and the presence of the 
antigen is detected, e.g., by measuring the absorbance at 280 run. 

An antibody of the invention can be screened for therapeutic efficacy as described as 
^ follows. According to an eleventh aspect of the invention, there is provided (i) a composition of 
C3 matter containing a monospecific antibody of the invention, together with a diluent or carrier; (ii) 
h|5 a pharmaceutical composition containing a therapeutically or prophylactically effective amount 
^ of.a monospecific antibody of the invention, and (iii) a method for treating or preventing a 
6? Chlamydia (e.g., C. trachomatis, C. psittaci, C. pneumoniae or C. pecorum) infection, by 
5 administering a therapeutic or prophylactic amount of a monospecific antibody of the invention 
Ll to an individual in need. Additionally , - therefevent h- aspect of the invention e ncompa s ses~th e^use^ 
|^0 of a monospecific antibody of the invention in the preparation of a medicament for treating or 
preventing Chlamydia infection. 

To this end, the monospecific antibody can be polyclonal or monoclonal, preferably 
of the IgA isotype (predominantly). In passive immunization, the antibody can be administered 
to a mucosal surface of a mammal, e.g., the gastric mucosa, e.g., orally or intragastrically, 
25 advantageously, in the presence of a bicarbonate buffer. Alternatively, systemic administration, 
not requiring a bicarbonate buffer, can be carried out. A monospecific antibody of the invention 
can be administered as a single active component or as a mixture with at least one monospecific 
antibody specific for a different Chlamydia polypeptide. The amount of antibody and the 
particular regimen used can be readily determined by one skilled in the art. For example, daily 
30 administration of about 100 to 1,000 mg of antibodies over one week, or three doses per day of 
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about 100 to 1,000 mg of antibodies over two or three days, can be an effective regimens for 
most purposes. 

Therapeutic or prophylactic efficacy can be evaluated using standard methods in the 
art, e g*, by measuring induction of a mucosal immune response or induction of protective and/or 
therapeutic immimity^using,^., the»Gk pneumoma^*mo,ii^imodel^ ThosetsMledrin the art will 
recognize that*the*C pneumoniae ^strain of the model can4>eMreplaced*with^^ 
strain. For example, the*effieacy of DNA molecules andrpolypeptides^from*G.^/?/2ewm<?/7/ae is 
preferably evaluated m?a* mouse^model^using^an-^C pneumoniae - strain?* Prbtejption can be 
determined by comparing the degree of Chlamydia infection to that of a control group. 
Protection is shown when infection is reduced by comparison to the control group. Such an 
evaluation can be made for polynucleotides, vaccine vectors, polypeptides and derivatives 
thereof, as well as antibodies of the invention. 

Adjuvants useful in any of the vaccine compositions described above are as follows. 

Adjuvants for parenteral administration include aluminum compounds, such as 
aluminum hydroxide^ata can 
be precipitated with j or *adsoibe.d*onto, the aluminum compound^ accordingito^standard protocols. 
Other adjuvants, such as RIBI^(ImmunoChem, Hamilton* MT), can^be*tused*dn parenteral 
administration. 

Adju\antsformucosabadtoinistratfo^^ 



(CT), the E. ca/f4ieatUabiIe*t©3an*^^ toxin 
(PT), or combinations^spbumtsqttpxoids^or mutantejthe^eig^ purified preparation 

of native cholera toxin subunit B (CTB) can be of use. Fragments, homologs, derivatives, and 
fusions to any of these toxins are also suitable, provided that they retain adjuvant activity. 
Preferably, a mutant having reduced toxicity is used. Suitable mutants are described, e.g., in WO 
95/17211 (Arg-7-Lys CT mutant), WO 96/6627 (Arg-192-Gly LT mutant), and WO 95/34323 
(Arg-9-Lys and Glu-1 29-Gly PT mutant). Additional LT mutants that can be used in the 
methods and compositions of the invention include, e.g., Ser-63-Lys, Ala-69-Gly, Glu-1 10-Asp, 
and Glu-1 12- Asp mutants.^ Other *adju^^ A 
(MPLA) of, e.g., E.*eoli«*Sdlmonella*minne$otai&& 

saponins, or Tolylac^ mucosal 
administration:^ 
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Adjuvants useful for both mucosal and parenteral administrations include 
polyphosphazene (WO 95/2415), DC-chol (3 b-(N-(N\N'-dimethyl aminomethane)-carbamoyl) 
cholesterol; U.S. Patent No. 5,283,1 85 and WO 96/1483 1) and QS-21 (WO 88/9336). 

Any pharmaceutical composition of the invention, containing a polynucleotide, a 
polypeptide, a polypeptide derivative, or an antibody of the invention, can be manufactured in a 
conventional manner. In particular, it can be formulated with a pharmaceutically acceptable 
diluent or carrier, e.g., water or a saline solution such as phosphate buffer saline. In general, a 
diluent or carrier can be selected on the basis of the mode and route of administration, and 
standard pharmaceutical practice. Suitable pharmaceutical carriers or diluents, as well as 
pharmaceutical necessities for their use in pharmaceutical formulations, are described in 
Remington's Pharmaceutical Sciences, a standard reference text in this field and in the USP/NF. 

The invention also includes methods in which Chlamydia infection, are treated by 
oral administration of a Chlamydia polypeptide of the invention and a mucosal adjuvant, in 
combination with an antibiotic, an antacid, sucralfate, or a combination thereof. Examples of 
such compounds that can be administered with the vaccine antigen and the adjuvant are 
antibiotics, including, e.g., macrolides, tetracyclines, and derivatives thereof (specific examples 
of antibiotics that can be used include azithromycin or doxicyclin or immunomodulators such as 
cytokines or steroids. In addition, compounds containing more than one of the above-listed 
-compone nts c oupl e d t o ge ther, c an be used. The invention also includes compositio n s^^ 
carrying out these methods, i.e., compositions containing a Chlamydia antigen (or antigens) of 
the invention, an adjuvant, and one or more of the above-listed compounds, in a 
pharmaceutically acceptable carrier or diluent. 

Amounts of the above-listed compounds used in the methods and compositions of the 
invention can readily be determined by one skilled in the art. In addition, one skilled in the art 
can readily design treatment/immunization schedules. For example, the non-vaccine components 
can be administered on days 1-14, and the vaccine antigen + adjuvant can be administered on 
days 7, 14, 21, and 28. 
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cactgtggat gtgatattcg cagaacctcc cgtcaaatat actctagata taggaagcaa 60 

attacgattt taaaccttat ttaacgacag ggttgaggc atg cct ctt tct ttc 114 

Met Pro Leu Ser Phe 
1 5 



aaa tct tea -tct*;tfctee*tgt* eta ctt- ge£,?tgt<? jtta tgte :agt -gcav^g^tgejK* 
Lys Ser Ser^Ser^Phe^Cys ^Leu«Leu^Al>a» Cys -Leu iGys Ser Al a . S e s^Gy'sjH*. 
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gcg ttt get . gag* act a gamete ggavggg^aac ttt gtt cct cca + at-t acg 210 
Ala Phe Ala Glu Th'r«*Ar'g Leu*<Gl$TvGly<Asn< Phe; Val Pror-Pro-* Ile .Thx^ 
25 30 35 

aat cag ggt gaa gag ate tta etc act tea gat ttt gtt tgt tea aac 258 
Asn Gin Gly Glu Glu lie Leu Leu Thr Ser Asp Phe Val Cys Ser Asn 
40 45 50 



ttc ttg ggg gcg agt ttt tea agt tec ttt ate aat agt tec age aat 
Phe Leu Gly Ala Ser Phe Ser Ser Ser Phe lie Asn Ser Ser Ser Asn 
55 60 65 
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etc tec tta tta ggg aag ggc ctt tec tta acg ttt ace tct tgt caa 354 
Leu Ser Leu Leu Gly Lys Gly Leu Ser Leu Thr Phe Thr Ser Cys Gin 
70 75 80 85 



get cct aca aat agt^.aae^tat>gcg* eta ctt tct gcc-gca<Mgag*act. etgu.. 
Ala Pro Thr Asn Ser^Asn- Tyr Ala Leu Leu, Ser- Ala Ala Glu Thr Leu*. 

90 > 95 100 , 
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2 acc ttc aag aat tttotct tct ^ata^ aac^t^ac^^jg^e^aa^g^aca^. 450 

^ Thr Phe Lys Asn Phe .Ser ver *Tle AsnrfPhfe>Thr4K31y^sn*^Gln^ 
= 105 110*:' 115 .^v 

tj, gg a ctt gyc ggo *ctc~ a-ge=. tacwgga aaa.-gat, att^.gtt tLc~caa tct \atc^, 4 9% 

Gly Leu Gly Gly Leu-^Ile Tyr-*Gly Lys-*Asp«irIle *Val Phe ^Gln Ser lie 
51 120 125 * 130 - 
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Leu 
265 


gga 
Gly 
Gly 


agt 
Ser 
Ser 


ggc 
Gly 
Gly 


gga 
Gly 
Gly 


gec 
Ala 
Ala 
270 


ate 
He 
He 


tgc 
Cys 
Cys 


ate 
He 
He 


cct 
Pro 
Pro 


acg 
Thr 
Thr 
275 


gga 
Gly 
Gly 


act 
Thr 
Thr 


930 


ttc 
Phe 
Phe 


gaa 
Glu 
Glu 


tta 
Leu 
Leu 
280 


aaa 
Lys 
Lys 


aac 
Asn 
Asn 


aat 
Asn 
Asn 


cag 
Gin 
Gin 


ggg 

Gly 
Gly 
285 


aag 
Lys 
Lys 


tgc 
Cys 
Cys 


acc 
Thr 
Thr 


ttc 
Phe 
Phe 


tct 
Ser 
Ser 
290 


tat 
Tyr 
Tyr 


aat 
Asn 
Asn 


ggt 
Gly 
Gly 


978 


y ^ 
Q 


aca 
Thr 
Thr 


cca 
Pro 
Pro 
295 


aat 
Asn 
Asn 


gat 
Asp 
Asp 


gcg 
Ala 
Ala 


ggt 

G-Ly 
Gly 


gcg 

Ala 

Ala 
300 


ate 

Tie 

He 


tac 
Tyr 
Tyr 


gec 
Ala 


gaa 

VjJ- LI 

Glu 


acc 
Thr 
Thr 
305 


tgc 
Cys 
Cys 


aac 
Asn 
Asn 


ate 
He 
He 


gta 
Val 
Val 


1026 


r ~ 

£□ 
no 


ggg 

Gly 
Gly 
310 


aac 
Asn 
Asn 


cag 
Gin 
Gin 


ggt 
Gly 
Gly 


gec 
Ala 
Ala 


ttg 
Leu 
Leu 
315 


etc 
Leu 
Leu 


eta 
Leu 
Leu 


gat 
Asp 
Asp 


age 
Ser 
Ser 


aac 
Asn 
Asn 
320 


act 
Thr 
Thr 


gca 
Ala 
Ala 


gcg 
Ala 
Ala 


aga 
Arg 
Arg 


aat 
Asn 
Asn 
325 


1074 




ggc 


gga 


gee 


ate 


tgt 


get 


aaa 


gtg 

=Vs4= 


etc 
Leu 


aat 

=As*p 


att 


caa 

-GXn- 


gga 


cgc 
-Arg- 


ggt 


cct 
Pro 


1122 


fO 


— G±y 
Gly 


Gly 


-A-l-a- 
Ala 


He 


~Cys^ 
Cys 
330 


=A£xF 
Ala 


Lys 
Lys 


Val 


Leu 


Asn 
335 


He 


Gln 


Gly 


Arg 


Gly 
340 


Pro 






att 
lie 
lie 


gaa 
Glu 
Glu 


ttc 
Phe 
Phe 


tct 
Ser 
Ser 
345 


aga 
Arg 
Arg 


aac 
Asn 
Asn 


cgc 
Arg 
Arg 


gcg 
Ala 
Ala 


gag 
Glu 
Glu 
350 


aag 
Lys 
Lys 


ggt 
Gly 
Gly 


gga 
Gly 
Gly 


get 
Ala 
Ala 


att 
lie 
lie 
355 


ttc 
Phe 
Phe 


at a 
He 
He 


1170 




ggc 
Gly 
Gly 


ccc 
Pro 
Pro 


tct 
Ser 
Ser 
360 


gtt 
Val 
Val 


gga 
Gly 
Gly 


gac 
Asp 
Asp 


cct 
Pro 
Pro 


gcg 
Ala 
Ala 
365 


aag 
Lys 
Lys 


caa 
Gin 
Gin 


aca 
Thr 
Thr 


teg 
Ser 
Ser 


aca 
Thr 
Thr 
370 


ctt 
Leu 
Leu 


acg 
Thr 
Thr 


att 

He 
He 


1218 




ttg 
Leu 
Leu 


get 
Ala 
Ala 
37 5 


tec 
Ser 
Ser 


gaa 
Glu 
Glu 


ggt 
Gly 
Gly 


gat 
Asp 
Asp 


att 
He 
He 
380 


gcg 
Ala 
Ala 


ttc 
Phe 
Phe 


caa 
Gin 
Gin 


gga 
Gly 
Gly 


aac 
Asn 
Asn 
385 


atg 
Met 
Met 


etc 
Leu 
Leu 


aat 
Asn 
Asn 


aca 
Thr 
Thr 


1266 




aaa 
Lys 
Lys 
390 


cct 
Pro 
Pro 


gga 
Gly 
Gly 


ate 
He 
He 


cgc 
Arg 
Arg 


aat 
Asn 
Asn 
395 


gec 
Ala 
Ala 


ate 
He 
He 


act 
Thr 
Thr 


gta 
Val 
Val 


gaa 
Glu 
Glu 
400 


gca 
Ala 
Ala 


ggg 

Gly 
Gly 


gga 
Gly 
Gly 


gag 
Glu 
Glu 


att 
He 
He 
405 


1314 




gtg 
Val 
Val 


tct 
Ser 
Ser 


eta 
Leu 
Leu 


tct 
Ser 
Ser 


gca 
Ala 
Ala 


eaa 
Gin 
Gin 


gga 
Gly 
Gly 


ggc 
Gly 
Gly 


tea 
Ser 
Ser 


cgt 
Arg 
Arg 


ctt 
Leu 
Leu 


gta 
Val 
Val 


ttt 
Phe 
Phe 


tat 
Tyr 
Tyr 


gat 
Asp 
Asp 


ccc 
Pro 
Pro 


1362 
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410 



415 



420 



att aca cat age etc cca acc aca agt ccg tct aat aaa gac att aca 
He Thr His Ser Leu Pro Thr Thr Ser Pro Ser Asn Lys Asp He Thr 
He Thr His Ser Leu Pro Thr Thr Ser Pro Ser Asn Lys Asp He Thr 
425 430 435 

ate aac get aat ggc get tea gga tct gta gtc ttt .aca agt- aag gga 

He Asn Ala Asn 'Gly Ala Ser Gly Ser Val Val Phe Thr Ser Lys Gly 
He Asn Ala AsnttGly* Ala^Sec^^ 

440 445" 4 50* 

etc tec tct aca gaa«- etc- ctg* ttg cct gec aac acg aca act ata ctt 

Leu .Ser Ser Thr^Glu Leu^Le^Leu .Pro Ala Asn Thr-* Thr- Thr^Ile Leu - 

Leu Ser Ser Tttr*Glu. Leu^Leui*Leu>:Pro~.Ala Asn^Thr^Thr -Thr- lie Leu 
455 4 60' 4 65** 

eta gga aca gtc aag ate get agt gga gaa ctg aag att act gac aat 
Leu Gly Thr Val Lys He Ala Ser Gly Glu Leu Lys He Thr Asp Asn 
Leu Glv Thr Val Lys He Ala Ser Gly Glu Leu Lys He Thr Asp Asn 
470 475 480 485 

acq gtt gtc aat gtt get ggc ttc get act cag ggc tea ggt cag ctt 
Ala Val Val Asn Val Ala Gly Phe Ala Thr Gin Gly Ser Gly Gin Leu 
Ala Val Val Asn Val Ala Gly Phe Ala Thr Gin Gly Ser Gly Gin Leu 
490 495 500 

acc ctg ggc tct gga gga acc tta ggg ctg gca aca ccc acg gga gca 
Thr Leu Gly Ser Gly Gly Thr Leu Gly Leu Ala Thr Pro Thr Gly Ala 
Thr Leu Gly Se^Gly^Ga^ 

50S£* 510- 515- * 

cct gec get gtst gae^tfct acg^at^ gga-aag^ta v gca^tc , ga^^c^ ttitv 

Pro Ala Ala Val<yAsp Phe^Thr^He. Gly^Ly^Leu^Ala Pfte*&spj&?ofrPKe*« . 

Pro Ala Ala Val* AsWJle**^ 

520 525*** 5 30* 



1410 



1458 



1506 



1554 



1602 



1650 



1698 



tec ttc eta aaa ^ aaa^ga v Wt€ 3 ^^ 
Ser Phe heu -hys^hm^ 
Ser Phe Leu- -iy^ArVgl^ 

535 54'0*£ S^S*** 

aaa aac gtc act tta aca gga get ctg gtt ctt gat gaa cat gac gtt 
Lys Asn Val Thr Leu Thr Gly Ala Leu Val Leu Asp Glu His Asp Val 
Lvs Asn Val Thr Leu Thr Gly Ala Leu Val Leu Asp Glu His Asp Val 
550 555 560 565 

aca gat ctt tat gat atg gtg tea tta caa tct cca gta gca att cct 
Thr Asp Leu Tyr Asp Met Val Ser Leu Gin Ser Pro Val Ala He Pro 
Thr Asp Leu Tyr Asp Met Val Ser Leu Gin Ser Pro Val Ala He Pro 
570 575 580 

ate get gtt ttc aaa gga gca acc gtt act aag aca gga ttt cct gat 

He Ala Val Phe Lys Gly Ala Thr Val Thr Lys Thr Gly Phe Pro Asp 

He Ala Val Phe Lys Gly Ala Thr Val Thr Lys Thr Gly Phe Pro Asp 

585 590 595 

qgq gag att gcg^ac£#ccawage*jeae^^ 1938 

Gly Glu He Al^WRh^*Pr^Se^^ 

Gly Glu He Al<aB*£h^Prd1*S^ 

* — 60S«& 61S0IK& 



1794 



1842 



1890 



600. 



tac aca tgg tec cgt ccc etg^tt-a^att^^ 

Tyr Thr Trp Ser Arg Pro Leu Leu lie' Pro Ala Pro Asp Gly Gly Phe 

34 



1986 
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Tyr 


Thr 
615 


Trp 


Ser 


Arg 


Pro 


Leu 
620 


Leu 


He 


Pro 


Ala 


Pro 
625 


Asp 


Gly 


Gly 


Phe 






cct 
Pro 
Pro 
630 


gga 
Gly 
Gly 


ggt 
Gly 
Gly 


ccc 
Pro 
Pro 


tct 
Ser 
Ser 


cct 
Pro 
Pro 
635 


age 
Ser 
Ser 


gca 
Ala 
Ala 


aat 
Asn 
Asn 


act 
Thr 
Thr 


etc 
Leu 
Leu 
640 


tat 
Tyr 
Tyr 


get 
Ala 
Ala 


gta 
Val 
Val 


tgg 
Trp 
Trp 


aat 
Asn 
Asn 
645 


2034 




tea 
Ser 
Ser 


gac 
Asp 
Asp 


act 
Thr 
Thr 


etc 
Leu 
Leu 


gtg 
Val 
Val 
650 


cgt 
Arg 
Arg 


tct 
Ser 
Ser 


acc 
Thr 
Thr 


tat 
Tyr 
Tyr 


ate 
He 
He 
655 


tta 
Leu 
Leu 


gat 
Asp 
Asp 


ccc 
Pro 
Pro 


gag 
Glu 
Glu 


cgt 
Arg 
Arg 
660 


tac 
Tyr 
Tyr 


2082 




qqa 
Gly 
Gly 


gaa 
Glu 
Glu 


att 
He 
He 


gtc 
Val 
Val 
665 


age 
Ser 
Ser 


aac 
Asn 
Asn 


age 
Ser 
Ser 


tta 
Leu 
Leu 


tgg 
Trp 
Trp 
670 


att 
He 
He 


tec 
Ser 
Ser 


ttc 
Phe 
Phe 


tta 
Leu 
Leu 


gga 
Gly 
Gly 
675 


aat 
Asn 
Asn 


cag 
Gin 
Gin 


2130 




qca 
Ala 
Ala 


ttc 
Phe 
Phe 


tct 
Ser 
Ser 
680 


gat 
Asp 
Asp 


att 
He 
He 


etc 
Leu 
Leu 


caa 
Gin 
Gin 


gat 
Asp 
Asp 
685 


gtt 
val 
Val 


ctt 
Leu 
Leu 


ttg 
Leu 
Leu 


ata 
He 
He 


gat 
Asp 
Asp 
690 


cat 
His 
His 


ccc 
Pro 
Pro 


ggg 

Gly 
Gly 


2178 




tta 
Leu 
Leu 


tec 
Ser 
Ser 
695 


ata 
He 
He 


acc 
Thr 
Thr 


acq 
Ala 
Ala 


aaa 
Lys 
Lys 


get 
Ala 
Ala 
700 


tta 
Leu 
Leu 


gga 
Gly 
Gly 


gee 
Ala 
Ala 


tat 
Tyr 
Tyr 


gtc 

Val 
Val 
705 


gaa 
Glu 
Glu 


cac 
His 
His 


aca 
Thr 
Thr 


cca 
Pro 
Pro 


2226 


M 

• J= 

lj; 


aaa 
Arg 
Arg 
710 


caa 
Gin 
Gin 


gga 
Gly 
Gly 


cat 
His 
His 


gag 
Glu 
Glu 


ggc 
Gly 
Gly 
715 


ttt 
Phe 
Phe 


tea 
Ser 
Ser 


ggt 
Gly 
Gly 


cgc 
Arg 
Arg 


tat 
Tyr 
Tyr 
720 


gga 
Gly 
Gly 


ggc 
Gly 
Gly 


tac 
Tyr 
Tyr 


caa 
Gin 
Gin 


get 
Ala 
Ala 
725 


2274 


UJ 


gcg 
Ala 
Ala 


eta 
Leu 
Leu 


tct 
Ser 
Ser 


atg 
Met 
Met 


aac 
Asn 
Asn 
730 


tac 
Tyr 
Tyr 


a eg 
Thr 
Thr 


gac 
Asp 
Asp 


cac 
His 
His 


act 
Thr 
Thr 
735 


acg 
Thr 
Thr 


tta 
Leu 
Leu 


gga 
Gly 
Gly 


ctt 
Leu 
Leu 


tct 
Ser 
Ser 
740 


ttc 
Phe 
Phe 


2322 


iy 
rn 


999 
Gly 
Gly 


Gin 
Gin 


ctt 

Leu 
Leu 


tat 
Tyr 
Tyr 
745 


aaa 
Gly 
Gly 


aaa 
Lys 
Lys 


act 
Thr 
Thr 


aac 
Asn 
Asn 


gec 
Ala 
Ala 
750 


aac 
Asn 
Asn 


ccc 
Pro 
Pro 


tac 
Tyr 
Tyr 


gat 
Asp 
Asp 


tea 
Ser 
Ser 
755 


cgt 
Arg 
Arg 


tgc 
Cys 
Cys 


2370 




tea 
Ser 
Ser 


gaa 
Glu 
Glu 


caa 
Gin 
Gin 
760 


at q 
Met 
Met 


tat 
Tyr 
Tyr 


tta 
Leu 
Leu 


etc 
Leu 
Leu 


teg 
Ser 
Ser 
765 


ttc 
Phe 
Phe 


ttt 
Phe 
Phe 


ggt 
Gly 
Gly 


caa 
Gin 
Gin 


ttc 
Phe 
Phe 
770 


cct 
Pro 
Pro 


ate 
He 
He 


gtg 
Val 
Val 


2418 




act 
Thr 
Thr 


eaa 
Gin 
Gin 
775 


aag 
Lys 
Lys 


age 
Ser 
Ser 


Glu 
Glu 


qcc 
Ala 
Ala 


tta 
Leu 
Leu 
780 


att 
He 
He 


tec 
Ser 
Ser 


tgg 
Trp 
Trp 


aaa 
Lys 
Lys 


gca 
Ala 
Ala 
785 


get 
Ala 
Ala 


tat 
Tyr 
Tyr 


ggt 
Gly 
Gly 


tat 
Tyr 
Tyr 


2466 




tec 
Ser 
Ser 
790 


aaa 
Lys 
Lys 


aat 
Asn 
Asn 


cac 
His 
His 


eta 
Leu 
Leu 


aat 
Asn 
Asn 
795 


acc 
Thr 
Thr 


acc 
Thr 
Thr 


tac 
Tyr 
Tyr 


etc 
Leu 
Leu 


aga 
Arg 
Arg 
800 


cct 
Pro 
Pro 


gac 
Asp 
Asp 


aaa 
Lys 
Lys 


act 
Ala 
Ala 


cca 
Pro 
Pro 
805 


2514 




aaa 
Lys 
Lys 


tct 
Ser 
Ser 


caa 
Gin 
Gin 


ggg 

Gly 
Gly 


caa 
Gin 
Gin 
810 


tgg 
Trp 
Trp 


cat 
His 
His 


aac 
Asn 
Asn 


aat 
Asn 
Asn 


agt 
Ser 
Ser 
815 


tac 
Tyr 
Tyr 


tat 
Tyr 
Tyr 


gtt 
Val 
Val 


ctt 
Leu 
Leu 


att 
He 
He 
820 


tct 
Ser 
Ser 


2562 



gca gaa cat cct ttc eta aac tgg tgt ctt ctt aca aga cct ctg get 

35 



2610 



Ala Glu His Pro Phe Leu Asn Trp Cys Leu Leu Thr Arg Pro Leu Ala 
Ala Glu His Pro Phe Leu Asn Trp Cys Leu Leu Thr Arg Pro Leu Ala 
825 830 835 

caa get tgg gat ctt tea ggt ttt att tec gca gaa ttc eta ggt - ggt 

Gin Ala Trp Asp Leu Ser Gly Phe He Ser Ala Glu Phe Leu Gly Gly 

Gin Ala Trp Asp Leu Ser Gly Phe He Ser Ala Glu Phe Leu Gly Gly 
840 845 850 

tgg caa .agt aag ttc aca< gaa**acfe gga^ga-t^ctg^eaa^egt c age-*ttt -agit=^ 

Trp Gin Ser Lys Phe -Thr -Glu Thir Gly Asp Leu^Glh Arg Ser- Phe>«.Sei?*> 

Trp Gin Ser Lys Phe Thr- Glu Thr Gly Asp^Leu*Gln Arg Ser— Phe^Se*?** 
855 8 60 865- 

aga ggt aaa ggg .tac, aatv gtt tec eta ccg.,ata gg.a^tgt tct* tct caa 

Arg Gly Lys Gly Tyr Asn Val Ser Leu Pro He Gly Cys Ser Ser Glri 

Arg Gly Lys Gly Tyr Asn Val Ser Leu Pro He Gly Cys Ser Ser Gin 

870 875 880 885 

tgg ttc aca cca ttt aag aag get cct tct aca ctg ace ate aaa ctt 

Trp Phe Thr Pro Phe Lys Lys Ala Pro Ser Thr Leu Thr He Lys Leu 

Trp Phe Thr Pro Phe Lys Lys Ala Pro Ser Thr Leu Thr He Lys Leu 

890 895 900 

gee tac aag cct gat ate tat cgt gtc aac cct cac aat att gtg act 

Ala Tyr Lys Pro Asp He Tyr Arg Val Asn Pro His Asn He Val Thr 

Ala Tyr Lys Pro Asp He Tyr Arg Val Asn Pro His Asn He Val Thr 
905 910 915 

gtc gtc tea aac caa^gag^gc^aot^tcg^^ 

" In,. .. GLu w Se"^TM«iiiSe^.,1 1 ev^Se'iasGl^Al-C £s-n 



2658 



2706 



2754 



2802 



2850 



2898 



03 950 



Val Val Ser Asn Gln 4 GLu^Ser^ 
Val Val Ser Asn - Gin Glu, Ser^>Th^ 

920 92V 930j^ 

cgc cac ggt ttg ttty gta^ejia^tc^ 2 94 6- 

Arg His Gly Leu -Phe^a^Gl^ 
Arg His Gly Leu^he^a^Gl-h^ 

935 940 ~ 945 — 

gac act cag gee- ttttfcta^aa<^ 2994 
Asp Thr Gin Ala Ehei*Le,u*Asn#Ty^ 
Asp Thr Gin Ala Phe^Leu^Asn^Ty^ 

955 960 965 



3043 



aca aac cac cga gtg tct aca gga eta aaa tec aca ttt taaaactcta 
Thr Asn His Arg Val Ser Thr Gly Leu Lys Ser Thr Phe 
Thr Asn His Arg Val Ser Thr Gly Leu Lys Ser Thr Phe 
970 975 

agetctgett agagttttct gtagccccgg tegtcttaga atcctctatc catcatcgaa 3103 

gaacttagca atgaaggeca agattctcac tctatgagaa ccccccc 3150 
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Figure 2 

Restriction enzyme analysis of CPN 100630 



Hpyl78III 
Mnll I 

TspRI Sthl32I Bfall 

Taal I Fokl BscGI I Xbal I I Tsp509I 

|| I I I I 1 I t 

CACTGTGGATGTGATATTCGCAGAACCTCCCGTCAAATATACTCTAGATATAGGAAGCAA 



-+ 60 



GTGACACCTACACTATAAGCGTCTTGGAGGGCAGTTTATATGAGATCTATATCCTTCGTT 



Nlalll 
Nspl 

Oral SphI Mnll 

Msel I Msel Mnll Cac8I I MboII I 

II I I I I > I 

ATTACGATTTTAAACCTTATTTAACGACAGGGTTGAGGCATGCCTCTTTCTTTCAAATCT 



61 



-+ 120 



TAATGCTAAAATTTGGAATAAATTGCTGTCCCAACTCCGTACGGAGAAAGAAAGTTTAGA 



Hinf I 
Mnll 
Bfal | 
Plel 1 I 
Ddel I I I 
I I I I 



121 



BsmAI 
Hhal 
Thai 
BseMII I 

AccI CviRI Mwol I I 

| I III 

TCATCTTTTTGTCTACTTGCCTGTTTATGTAGTGCAAGTTGCGCGTTTGCTGAGACTAGA 

+ + + + + + 180 

AGTAGAAAAACAGATGAACGGACAAATACATCACGTTCAACGCGCAAACGACTCTGATCT 

Hpyl88IX 
Tthlllll 
MboII 



HphT 



Dpnl 
Bglll | 
BstYI | 

Hinfl Sau3AI I 

Tfil Eco57I | | 

Tsp509I Mnll I Earl I I I 

I III III 

CTCGGAGGGAACTTTGTTCCTCCAATTACGAATCAGGGTGAAGAGATCTTACTCACTTCA 

181 + + + + + + 240 

GAGCCTCCCTTGAAACAAGGAGGTTAATGCTTAGTCCCACTTCTCTAGAATGAGTGAAGT 



Hpyl88lX 



241 



GATTTTGTTTGTTCAAACTTCTTGGGGGCGAGTTTTTCAAGTTCCTTTATCAATAGTTCC 

+ + + + + + 300 

C TAAAACAAACAAGTTTGAAGAACCCCCGCTCAAAAAG T TCAAGGAAATAGT TAT CAAGG 



CviJI 

Haelll Acll Alul 

EcoO109I | Maell CviJI 

BslI Sau96I I Msel I Mnll I 

III II M 

AGCAATCTCTCCTTATTAGGGAAGGGCCTTTCCTTAACGTTTACCTCTTGTCAAGCTCCT 

3d + + + + + + 360 

TCGTTAGAGAGGAATAATCCCTTCCCGGAAAGGAATTGCAAATGGAGAACAGTTCGAGGA 
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Maelll 
I 



Hhal 
I 



Plel 
Acil | 
Fnu4HI | 
Taul I 
BsmAI I I 
I I I 



Apol 
Tsp509I 
Hpyl88IX MboII| 
Hinfl | Hpyl78III I I 
II III 



ACAAATAGTAACTATGCGCTACTTTCTGCCGCAGAGACTCTGACCTTCAAGAATTTTTCT 

361 + ~+ + ' + + "+ 

TGTTTATCATTGATA6GCGATGAAAGAGGGGGTCTCTGAGA^TGGAAG^FJT€TTAAAAAGA 



420 



m 



DrclII CviJI 

NlalVI Haelll Cjel . 

Bsp24I I I FriU'4'HI I Mriil 

Cjel J I Taul I CjePlI 

CjePIM TaqI Acil | I Bsp24I|| 

III I III I I I 

TCTATAAACTTTACAGGGAACCAATCGACAGGACTTGGCGGCCTCATCTACGGAAAAGAT 

421 + + + + + + 480 

AGATATTTGAAATGTCCCTTGGTTAGCTGTCCTGAACCGCCGGAGTAGATGCCTTTTCTA 

Cjel 
Bsbl 

Dpnl Taal I 

Sau3AI | Bpml | I 

MboII I I Pflll08I I I I 
I II I I I I 

ATTGTTTTCCAATCTATCAAAGATTTGATCTTCACTACGAACCGTGTTGCCTATTCTCCA 

481 + + + + + + 540 

TAACAAAAGGTTAGATAGTTTCTAAAGTAGAAG>T-GATGCTTGGCAeAAGGGATAAGAGGT 

Maelll 
Sfcl 
Faul 



Maell 
SfaNI. I 
Maelll ' I I 



Sthl32I 
Acil I 
Mwol | I 



4=1= 



CviJI 



I 



I 



I I 



II II III I 

GCATGTGTAACTAGG.TCGGCAAG»TCCCGCAA*B(SACT»ACAGTAACTAGAGGAGGCTCTGGT 

541 CGTAGACATTGATGCAGCCGTTGAGGGGGTTAGTGATGTCATTGATGTCCTCGG AGACGA 



Mmel Dpnl 

Sfcl TspRI Sau3AI I 

Plel I TaqI | Clal | | 

Mnll | | Hinfl Taal || TaqI | | 

till IN I I I 

CTCCAACCTACAGACTCACTCACTGTCGAAAACATATCCCAATCGATCAAGTTTTTTGGG 

601 + + + + + + 660 

GAGGTTGGATGTCTGAGHGAGTGACAGCTTTTGTATAGGGTTAGCTAGTTCAAAAAAGCe % 

Bce^I^ 

Tsp509I Apo£%. , I 

CviRlX BsaJrI. . Tsp50,9Ir« I 

NlalV CviiJir || Bs^E>SI";; Mse€** | I 

| III I III 

AACCTTGCCAACTTCGGCTCTGCAATTAGCAGTTCTCCCACGGCAGTCGTTAAATTCATC 
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661 



720 



TTGGAACGGTTGAAGCCGAGACGTTAATCGTCAAGAGGGTGCCGTCAGCAATTTAAGTAG 



Acil 
I 



Alul 
CviJI 
Nlalll I 
I 



BsaXI 
Fnu4HI 
Hpyl78III Taul 
Mnll | Acil | 

I I I I 



Mnll 
I 



721 



AATAACACCGCTACCATGAGCTTCTCCCATAACTTTACTTCGTCAGGAGGCGGCGTGATT 

+ + + + + + 

TTATTGTGGCGATGGTACTCGAAGAGGGTATTGAAATGAAGCAGTCCTCCGCCGCACTAA 



780 



Nsil 
HaeIV| 
Hin4I | 
HphI I 
CviRI I | 
MboII | | | Acil 
Hpyl78III I III Fokl | 
Alul Tsp509I I I I I I SfaNI I 

CviJI SfaNI | III) ICjel I I 

| II I I I I I I I I 

TATGGAGGAAGCTCTCTCCTTTTTGAAAACAATTCTGGATGCATCATCTTCACCGCCAAC 

781 + + + + + + 840 

ATACCTCCTTCGAGAGAGGAAAAACTTTTGTTAAGACCTACGTAGTAGAAGTGGCGGTTG 



Maelll 
Tsp45I 
BsaHI 
Cjel 
Mwol 



HphI 
Msel 
Alul | 
CviJI | 
Hgal I I 
! I I 



NlalV 
Hpyl78III I 
BslI I I 
EcoNI | | I 
III I 



rCAACAGCTTAAftAG€€GTCACCCCTTCA^€AGeAACnTATr : CTTTAr , G A^ 



GT- 



841 



AGGACACACTTGTCGAATTTTCCGCAGTGGGGAAGTAGTCCTTGGATACGAAATCCTTCA 



-+ 900 



CviRI 
Sthl32I 
Bed 
CviJI 
NlalV 



Ecil 
Acil ! 
Fokl j | 



BscGI 
SfaNI | 
I I 



Msel 
Tsp509I I 
NspV | t 
TaqI | | 
I I I 



BseSI 
CviRI 
ApaLI I 
I 1 



901 



GGCGGAGCCATCTGCATCCCTACGGGAACTTTCGAATTAAAAAACAATCAGGGGAAGTGC 
CCGCCTCGGTAGACGTAGGGATGCCCTTGAAAGCTTAATTTTTTGTTAGTCCCCTTCACG 



960 



BsiHKAI 
Bspl286I 



Faul 
Sthl32I i 
Rsal I I 
SfaNI | | 
I I 



Mwol 
Dpnl | 
Acil Sau3AI I I 
I I I 



CviRI 



961 



ACCTTCTCTTATAATGGTACACCAAATGATGCGGGTGCGATCTACGCCGAAACCTGCAAC 
TGGAAGAGAATATTACCATGTGGTTTACTACGCCCACGCTAGATGCGGCTTTGGACGTTG 



1020 
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Pflll08I 
BspMI I 
I I 



NlalV 
BanI 
ScrFI 
BsaJII 
Drdlll I 
EcoRII I I 
NlalVI I | 
MM 



Bfal 



PstI 
TspRI 
Fnu4HI | 
CviRI | | 
Tsell I 
BtsI | | | 
Sfcl I | | 
Bsbl | Ml 
I I Ml 



NlalV 
Ecil 1 
Acil I | 
Bbvl | | | 
I I t I 



1021 ■—■ ■ — +— — -+--- — — - — -1080* 

TAGCA^T<§CGT4EGGTCCCACGGAA€GAGGATCTA$@GTTGT^^^ 

AvaM Hp«y*l*78r>I.I 
Sau&6>I*«t ApofeE*: 
Sspl Acil | EcoRI 

Thai I Tsp509I Bfal I 
Tthllll IHgal | Xbal | | Acil 

til I M 

GCCATCTGTGCTAAAGTGCTCAATATTCAAGGACGCGGTCCTATTGAATTCTCTAGAAAC 

1081 + + + + + + 1140 

CGGTAGACACGATTTCACGAGTTATAAGTTCCTGCGCCAGGATAACTTAAGAGATCTTTG 



Bed 
CviJI |MwoI 
I I 



BsiHKAI 
Bspl286I 
I 



NlalV 
CviJI | 

Mmel Haellll BslI 

Alul | EcoO109lM Bsal I Mnll 
CviJI I Sau96I I IBsmAI | SimI 
II II I t I I I I 

CGCGCGGAGAiRGGG^GG:^ 

GCGCGCS-T<3rTO ! eeeA@^ 



Acil 
Hhal 
Thai 
Thai I 



Hpy^8.8''KC^Vr 



BsaJ^Iv^ 



-Hphl J 



Nl<a-HI.: 
Nsptfj* 



ej^i-^ri- 



I II II II 

ACATGGA©A<&T*TA@GATfTtT^ 
1201 — = - + ^ r ..--~--^+^-. Tr ^-.^-^---^. — + 12 60 



Hinfl 
Tfil 
ScrFI I 
EcoRI I I I Acil 
III I 



Cjel TspRI 
Bed I Taal I 
BsrDIISfcIl I 
II II I 



Hin4I 
Hin4I | 
BsaXI I | 
Bsgl I | I 
till 



1261 



AATACAAAACCTGGAATCCGCAATGCCATCACTGTAGAAGCAGGGGGAGAGATTGTGTCT 

+ + + + + + 1320 

TTATGTTTTGGACCTTAGGCGTTACGGTAGTGACATCTTCGTCCCCCTCTCTAACACAGA 



Dpnl 

CviRI Maell Sau3AI | CviJI 

BsmAI Mnll CviJI | Alwl I | BsaXI | 

t I II III M 

CTATCTG©ApAAGGAGG<^^^^ 

GATAGA@GTGT*re<50!:eCG^^ 

Alwl* 
Ha%£$pft*| 
Hin4I | 
Sfcl | | 
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1381 



AlwNI 
Dpnl 
BstYI 
Sau3AI 
Hpyl78III 
Haell I 
Hhal I I 

Mnll Eco57I Mwol | I I 

| I lit! 

ACCACAAGTCCGTCTAATAAAGACATTACAATCAACGCTAATGGCGCTTCAGGATCTGTA 

. + + + + + + 1440 

TGGTGTTCAGGCAGATTATTTCTGTAATGTTAGTTGCGATTACCGCGAAGTCCTAGACAT 



BseRI BsmFI 

PlellHinfl Sfcl I Mnll Cac8I Bsbl 

III III II 

GTCTTTACAAGTAAGGGACTCTCCTCTACAGAACTCCTGTTGCCTGCCAACACGACAACT 

1441 + + + + + + 1500 

CAGAAATGTTCATTCCCTGAGAGGAGATGTCTTGAGGACAACGGACGGTTGTGCTGTTGA 



in 



Dpnl 
Sau3AI 
Hpyl78III 
Taal 1 
CjePI I I 
Bfal II II IBfal 

I 1 



Eco57I 
Acil I 
CjePI MboII I 1 
I III 



ATACTTCTAGGAACAGTCAAGATCGCTAGTGGAGAACTGAAGATTACTGACAATGCGGTT 



1501 



-+ 1560 



TATGAAGA T€eCTg^H ^£TTCTAGCGftTrA^^^ 



Hpyl78III 
Banll 
Bspl286I 
CviJI 
Mnll 



Banll 
Bspl286I 
Mwol BpulOI) 
CviJI I Ddell 
Cac8l I I Ddel CviJI | | BseMII I 
III I Ml II 



ScrFI 
Bsa JI 
BsaJI I 
BseMII | 
EcoRIII 
Alul I I 
CviJI I I 
I I 
I I 



NlalV 
I 



GTCAATGTTGCTGGCTTCGCTACTCAGGGCTCAGGTCAGCTTACCCTGGGCTCTGGAGGA 

1561 + + + + + + 1620 

CAGTTACAACGACCGAAGCGATGAGTCCCGAGTCCAGTCGAATGGGACCCGAGACCTCCT 



AccI 
BspMI 
Sfcl | 
MspAlII 1 
Acil I I I 
Fnu4HI | | I 
Taul I I I 
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Mwol 
TaqI I 
Aarl 
BsiHKAI 
Bspl286I 
BscGI 
BsaJI 
BstDSI 
BsbJ^ 
BpimE. 
S t hi 32,1;, 
Bsu86I Cac8I I 
Dde s I«*^CviJI I 1 
I lit 

AC^1^T7?G GGG^GGG^kftSAGG CAGGGGAGGAG© TGGS G GTG T'AGAGvRT TAGGAT*»UGG!AAftGi^ 

1621 + + + + + + 1680 

TGGAATCCCGACCGTTGTGGGTGCCCTCGTGGACGGCGACATCTGAAATGCTAACCTTTC 



1681 



Dpnl 
Sau3AI | 

TaqI | | Cac8I 
Alwl | | | CviRI | 

Bsml I I I SfaNI I I 

- I III III 
TTAGCATTCGATCCTTTTTCCTTCCTAAAAAGAGATTTTGTTTCAGCATCAGTAAATGCA 

+ _+_-._-__; + + + 1740 

AATCGTAAG&TAGGAAAAAGGmGGATTTTTCTCTAAAAGA^GT ~ 



Hpyl78Iir 
Drdll; 
Alo A Iv-„, 

Ppi^^ : 

Bafi^iDeir. 



BspU^8 6T^, 
MaeM*Tfc Sac'I-^ 
Tsp^SI ■- Alu-I-- I I 

Mae I-I*| Mse***** CviJ*I | | 
II I I I I 

GGCACAAAAAACGTCACTTTAACAGGAGCTCTGGTTCTTGATGAACATGACGTTACAGAT 

1741 + — ■ + + + + + 1800 

CCGTGTTTTTTGCAGTGAAATTGTCCTCGAGACCAAGAACTACTTGTACTGCAATGTCTA 



Dpn*I~ 

MaW^^Sgl^I-. - 1 
MaeOsI | Bst« | 
Nlalll I |Sai*3AI*.i | 
III II 



Bpml 

HaelV Tsp509l 
Hin4I BsrI I 

I II 
CTTTATGATATGGTGTCATTACAATCTCCAGTAGCAATTCCTATCGCTGTTTTCAAAGGA 

1801 + + + + + + 1860 

GAAATACTATACCACAGTAATGTTAGAGGTCATCGTTAAGGATAGCGACAAAAGTTTCCT 



Maelll 
Taal 
BsaXI I Ddel** 
I I 



Hinfl 

Bee I " Hi n 4 1*^| 
Hpylp8/I^I^; I Plel^ I i 
I III 



CviJI 

Mwo»Imm^ 
Cjel^l 
CviJI | | : 

I I I 



GCAACCGTTXCTAAGAGAGGATTTCS^ 

1861 — + + + + + + 1920 

CGTTGGCAATGATTCTGTCCTAAAGGACTACCCCTCTAACGCTGAGGTTCGGTGATGCCG 
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Tsp509I 
Sthl32I 
BscGI 



BsaJI 
Styl 
1 



BsmFI 
Avail 
Sau96I 
Bcefl I 
BsmFI I 
I I 



Cjel 
NlalV 
Avail 
Nlalll 
Sau96I 
BslI I 
I I 



Msel 



Acelll 
BsmFI 
BslI 
Bed 
EcoNI 
Hpyl7 8III 
AlwNI | 
Mnlll 
Alul I I 
CviJI | I 
I I I 



TACCAAGGAAAGTGGTCCTACACATGGTCCCGTCCCCTGTTAATTCCAGCTCCTGATGGA 

1921 + + + + + + 1980 

ATGGTTCCTTTCACCAGGATGTGTACCAGGGCAGGGGACAATTAAGGTCGAGGACTACCT 





NlalV 






lP 


Avail I 








EcoOl09I | 






==? I 


Psp5II | 








Sau96I I 


Bpml 


Hpyl8 8IX 


rn 


ScrFI 1 1 


Hhall 


Apol i 


3 


EcoRII I 1 1 


Mnlll | 


EcoRI | 




Mnll II II 
III 1 =t= 


Bfal I | I 
1 III 


Tsp509I 1 
1 1 



GGATTTCCTGGAGGTCCCTCTCCTAGCGCAAATACTCTCTATGCTGTATGGAATTCAGAC 

1981 + + + + + + 2040 

CCTAAAGGACCTCCAGGGAGAGGATCGCGTTTATGAGAGATACGACATACCTTAAGTCTG 

Sthl32I 
Maelll 
Hpyl78III 
Aval 
Dpnl 



BssSI 
MslI 
I 



BstYI 
Sau3AI 
Ddel I 
Alwl I | 
I I I 



Cjel 
BsaXI | 
Hin4I | 
Tsp509I| I 
I [ I 



ACTCTCGTGCGTTCTACCTATATCTTAGATCCCGAGCGTTACGGAGAAATTGTCAGCAAC 

2041 + + + + + + 2100 

TGAGAGCACGCAAGATGGATATAGAATCTAGGGCTCGCAATGCCTCTTTAACAGTCGTTG 

Alul Hpyl88IX 
CviJI Ddel Cjel BsmI I FoJcI 

I III! I 

AGCTTATGGATTTCCTTCTTAGGAAATCAGGCATTCTCTGATATTCTCCAAGATGTTCTT 

2101 + + + + + + 2160 

TCGAATACCTAAAGGAAGAATCCTTTAGTCCGTAAGAGACTATAAGAGGTTCTACAAGAA 
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Dpn* 
Ha-e&VI 
Hin v 4I| 
Sau3$£«M 
Sthl32I | IBsaJII | 
Ml III 



S£hl32I 
Nciiw 

Smal 
Msp*I I 
Neil I 
ScrFI | 
Ava*Iv| I 



2161 



Alul 
CviJI 
Hindi I I | 
Thfaffj* I I 
Acil | I I 
II II 

TTGATAGATCATCCCGGGTTGTCCATAACCGCGAAAGCTTTAGGAGCCTATGTCGAACAC 

+ + + + + + 2220 

AACTATCTAGTAGGGCCCAACAGGTATTGGCGCTTTCGAAATCCTCGGATACAGCTTGTG 



Bs-am- 
Hrn.4'1-* 
CviJI | 
Nl*aJVT- | 
Mwol | | | TaqI 
I I I I I 



y i 

m 



Fnu4HI 
CviJI Alul I 

Mnll CviJI Mwol | CviJI | 

BslI | Nlalll I Mnll Bbvl | | TseI|HhaI 

II II I I I I III 

ACACCAAGACAAGGACATGAGGGCTTTTCAGGTCGCTATGGAGGCTACCAAGCTGCGCTA 

2221 — + + + + + + 2280 

TGTGGT^G&GT^CCT 



Avail.. , 
Sau^p«*;; 
Cjelv I 



Aluil, 
CviJI 
Fnu-4iH»I^,i 

Cjr^Mljl I 
I I 



m 



Mae^I*I$iSth*l<fS^^r Cjre^Xqgl: I I Bb&W 
III I I <M I I 

??0 1 --- — ^^~^^^~r— 23 4-Q- 

Hi«frf«Ef** 
Tfil , Hpy,1.88*X. 
Pflll08I |MaeII Ddel \ BseMII 
III II I 

ACTAACGCCAACCCCTACGATTCACGTTGCTCAGAACAAATGTATTTACTCTCGTTCTTT 

2341 + + + + + + 2400 

TGATTGCGGTTGGGGATGCTAAGTGCAACGAGTCTTGTTTACATAAATGAGAGCAAGAAA 



Tsp509l 
I 



Hinfl 
Hpyl78III 
Maellll 
Tsp45I| 

Plel-.. | | 
I I I 



Mnll_ 



ScrFI 
EcoRII | 



Tsp509I 
Msel 
CviJI 
Hael 
Haelll 
Stu-I- 
I 



I 



Alul 
CviJI 
Fnu4HI | 
TselJ I 
I I I 



GGTCAAT*T\G@S'TA$(§JG 
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Alul 
CviJI 
BseMIII 

HphI Hpyl88IX Bce83I | | 

Bbvl I Ddel | Mnlll II Smll 

I 1 II II II 1 

GGTTATTCCAAAAATCACCTAAATACCACCTACCTCAGACCTGACAAAGCTCCAAAATCT 

24 61 + + + + + + 2520 

CCAATAAGGTTTTTAGTGGATTTATGGTGGATGGAGTCTGGACTGTTTCGAGGTTTTAGA 

PstI 
CviRI | 

BsrDI Maelll Fokl Sfcl | I Alol 

I ' I I I I I I 

CAAGGGCAATGGCATAACAATAGTTACTATGTTCTTATTTCTGCAGAACATCCTTTCCTA 

2521 + + + + + + 2580 

GTTCCCGTTACCGTATTGTTATCAATGATACAAGAATAAAGACGTCTTGTAGGAAAGGAT 



Dpnl 
BstYI 
Sau3AI 
Alul 
CviJI 
Hindlll 
Mnll 

Bbsl Smll I 

MboII BsrI Bce83I CviJI | II II Alwl Acil 

|| I I I I I I I I I 

AACTGGTGTCTTCTTACAAGACCTCTGGCTCAAGCTTGGGATCTTTCAGGTTTTATTTCC 

2581 + + + + + + 2640 

TTGACCACAGAAGAATGTTCTGGAGACCGAGTTCGAACCCTAGAAAGTCCAAAATAAAGG 



03 



Bfal 
Avrll I 
BsaJI I 



Apol 



Maell 
CviRI 
Dpnl 
Bglll 
BsrI 
R s1 ~YT 



AlnT 



EcoRI I I Sau3AI | I | CviJI 

Tsp509I Styll Cjel AlwNI I I I ICjell 

| || I I I I I I II 

GCAGAATTCCTAGGTGGTTGGCAAAGTAAGTTCACAGAAACTGGAGATCTGCAACGTAGC 

2641 + + + + + + 2700 

CGTCTTAAGGATCCACCAACCGTTTCATTCAAGTGTCTTTGACCTCTAGACGTTGCATCG 

Bpml Drdll 
Mnlll Rsal MboII Fokl I 

II I I II 

TTTAGTAGAGGTAAAGGGTACAATGTTTCCCTACCGATAGGATGTTCTTCTCAATGGTTC 

2701 + + + + + + 2760 

AAATCATCTCCATTTCCCATGTTACAAAGGGATGGCTATCCTACAAGAAGAGTTACCAAG 



NlalV Bed BsaBI 

Msel CviJI | TspRII CviJI EcoRV | 

I II II III 

ACACCATTTAAGAAGGCTCCTTCTACACTGACCATCAAACTTGCCTACAAGCCTGATATC 

2761 — + + + + + + 2820 

TGTGGTAAATTCTTCCGAGGAAGATGTGACTGGTAGTTTGAACGGATGTTCGGACTATAG 
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2821 



Ddel 
Dpnl 

Maelll Sau3AI 
Tsp45I Taqll 
Mnlll PshAI BsmAI BsiHKAI || 

Hindi Sspl || Taall BsmBI Bspl28 6I || 

I III I i I III 

TATCGTGTOAAGCCTCA£^TAT,TGTGACTGTCGT^ 

+ + — + - S -+ J + — - ^<--+ 2880 

ATAG@A(5AGT*TGGGAGEGT-TAT^ 



Hpyl78III 



BsaJI 
BstDSI 
Tthlllll 
Acil | 
Fnuv4HI*| 
BseMII [ | 
Cjel |TauI| 
II II 



Dpnl 
Bg>l*I«I 

Rsal BstYI 
BsrGI I Cjel Sau3AI 

TatI | Nlalll HphI | 

I I 



ITaal 

I I 

TCAGGAGCAAATCTACGCCGCCACGGTTTGTTTGTACAAATCCATGATGTAGTAGATCTC 
2881 + + + + + + 

AGTCCTCGTTTAGATGCGGCGGTGCCAAACAAACATGTTTAGGTACTACATCATCTAGAG 



2940 



en 



m 



CviJI 

Hael Bsp24I 

Haelll Sthl32I BscGI CjePI 

BsaJI Ddel StuI BseMII | BslI | Cjel| 

1 • ■ Jrts%:> i II \ I II 
ACCGAGGACACTCAGGCCTT^TCTAAAGTATACCT^^ 
2941 -r — — -+--,«- „^_+„, + 3000 

TGGCTCGTGTG^GTCeGGAAAGATvTTO 

Sfel Cjel u Alul^ 

Acel | CjeP^to Dral<^ CviJI ♦ 

Dralll ||?- Bsp24*l3l4k MsefrW Ddel | Ddel- 

I I I II II I I I 

CACCGAGTG^CTAeAGGACT^ttftAATC 



GTGGCTCACAGfttfGT(5CT'GXT*T-fra 



Hinfl 
Ddel | 
Sthl32I 
BsiEI 
Mspl 

Neil lit | 
ScrFI •: III | 
BsaJI- | 
CviJI | * | 
Sf cl | | | 

I III 



Taq*I** 
MnMfc' | 
Bcetf^l | , 
I I f 

TCTGTAGCCeGGGECGTGKTAG/^^ 

3061 + + +- + + ■- + 3120 

AGACATCGGGGCCAGCAGAATCTTAGGAGATAGGTAGTAGCTTCTTGAATCGTTACTTCC 



CviJI 
Hael 
Haeirll 
Bs^E)!^! 

DdeI^MboM-v v * < | | | 
I I III 



46 



RY47 101998 



Hin4I 
Hinfl I 
Tfil I 
I I 

CC AAG AT T C T CAC T CT AT G AGAACCCC C CC 

3121 + + + 3150 

GGTTCTAAGAGTGAGATACTCTTGGGGGGG 



r— s 



WM 
ft* 
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